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(54) Title: ADENO-ASSOCIATED VIRUS VECTORS AND USES THEREOF 
(57) Abstract 



The invention provides an isolated 
and purified DNA molecule comprising at 
least one DNA segment, a biologically ac- 
tive subunit or variant thereof, of a circu- 
lar intermediate of adeno-associated virus, 
which DNA segment confers increased epi- 
somal stability, persistence or abundance of 
the isolated DNA molecule in a host cell. 
The invention also provides a composition 
comprising at least two adeno-associated 
virus vectors. 
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ADENO- ASSOCIATED VIRUS VECTORS AND USES THEREOF 
Cross-Refer ence to Related Applications 

This application is a continuation-in-part, and claims priority of invention 
under 35 U.S.C § 1 19(e), from U.S. application Serial No. 60/086,166, filed May 
5 20, 1998, currently pending, the disclosure of which is incorporated by reference 
herein. 

Background of the Invention 

Adeno-associated virus (AAV) is a non-pathogenic parvovirus with a 
single-stranded DNA genome of 4680 nucleotides. The genome may be of either 

1 0 plus or minus polarity, and codes for two groups of genes, Rep and Cap (Berns 
et al., 1990). Inverted terminal repeats (ITRs), characterized by palindromic 
sequences producing a high degree of secondary structure, are present at both 
ends of the viral genome. While other members of the parvovirus group 
replicate autonomously, AAV requires co-infection with a helper virus (i.e., 

1 5 adenovirus or herpes virus) for lytic phase productive replication. In the absence 
of a helper virus, wild-type AAV (wtAAV) establishes a latent, non-productive 
infection with long-term persistence by integrating into a specific locus on 
chromosome 19, AAVS1, of the host genome through a Rep-facilitated 
mechanism (Samulski, 1993; Linden et al., 1996; Kotin et al., 1992). 

20 In contrast to wtAAV, the mechanism(s) of latent phase persistence of 

recombinant AAV (rAAV) is less clear. rAAV integration into the host genome 
is not site-specific due to deletion of the AAV Rep gene (Ponnazhagan et al., 
1997). Analysis of integrated proviral structures of both wild type and 
recombinant AAV have demonstrated head-to-tail genomes as the predominant 

25 structural forms. 

rAAV has recently been recognized as an extremely attractive vehicle for 
gene delivery (Muzyczka, 1992). rAAV vectors have been developed by 
substituting all viral open reading frames with a therapeutic minigene, while 
retaining the cis elements contained in two inverted terminal repeats (ITRs) 

30 (Samulski et al., 1987; Samulski et al., 1989). Following transduction, rAAV 
genomes can persist as episomes (Flotte et al., 1994; Afione et al., 1996; Duan et 



1 



regard (Xiao et al., 1996; Clark et al., 1997; Fisher et al. 1997). Also, the 
molecular processes and/or structures associated with episomal long-term 
persistence of rAAV genomes, e.g., in nondividing mature myofibers, remains 
unclear. 

Thus, there is a need for rAAV vectors that have increased stability 
and/or persistence in host cells. Moreover, there is a need for vectors useful to 
express large open reading femes. 

The discussion of the background to the invention herein is included to 
explain the context of the invention. This is not to be taken as an admission 
that any of the material referred to was published, known or part of the common 
general knowledge in Australia as at the priority date of any of the claims. 



The present invention provides a recombinant adeno-associated virus 
(rAAV) vector comprising a nucleic acid segment formed by the juxtaposition of 
sequences in the AAV inverted terminal repeats (ITRs) which are present in a 
circular intermediate of AAV. The circular intermediate was isolated from 
rAAV-infected cells by employing a recombinant AAV "shuttle" vector. The 
shuttle vector comprises: a) a bacterial origin of replication; b) a marker gene or 
a selectable gene; c) a 5' ITR; and d) a 3' ITR Preferably, the recombinant 
AAV shuttle vector contains a reporter gene, e.g., a GFP, alkaline phosphatase or 
P-galactosidase gene, a selectable marker gene, e.g., an ampicillin-resistance 
gene, a bacterial origin of replication, a 5' ITR and a 3' ITR- The vector is 
contacted with eukaryotic cells so as to yield transformed eukaryotic cells. Low 
molecular weight DNA ("Hirt DNA") from the transformed eukaryotic cells is 
isolated. Bacterial cells are contacted with the Hirt DNA so as to yield 
transformed bacterial cells. Then bacterial cells are identified which express the 
marker or selectable gene present in the shuttle vector and which comprise at 
least a portion of a circular intermediate of adeno-associated virus. Also, as 
described below, it was found that circularized intermediates of rAAV impart 
episomal persistence to linked sequences in Hela cells, fibroblasts and muscle 
cells. In HeLa cells, the incorporation of certain AAV sequences, e.g., ITRs, 
from circular intermediates into a heterologous plasmid conferred a 10-fold 
increase in the stability of plasmid-based vectors in HeLa cells. Unique features 
of these transduction intermediates included the in vivo circularization of a 
head-to-tail monomer as well as multimer (concatamers) episomal viral genomes 
with associated specific base pair alterations in the 5' viral D-sequence. The 
majority of circular intermediates had a consistent head-to-tail configuration 
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al., 1998), or alternatively can integrate randomly into the cellular genome 
(Berns et al., 1996; McLaughlin et al., 1988; Duan et al., 1997; Fisher-Adams et 
al, 1996; Kearns et al., 1996; Ponnazhagan et al., 1997). However, little is 
known about the mechanisms enabling rAAV vectors to persist in vivo or the 

5 identity of cellular factors which may modulate the efficiency of transduction 
and persistence. Although transduction of rAAV has been demonstrated in vitro 
in cell culture (Muzyczka, 1992) and in vivo in various organs (Kaplitt et al., 
1994; Walsh et al., 1994; Conrad et al., 1996; Herzog et al., 1997; Snyder et al., 
1997), the mechanisms of rAAV-mediated transduction remain unclear. 

1 0 Moreover, while rAAV has been shown to be capable of stable, long- 

term transgene expression both in vitro and in vivo in a variety of tissues, the 
transduction efficiency of rAAV is markedly variable in different cell types. For 
example, rAAV has been reported to transduce lung epithelial cells at low levels 
(Halbert et al., 1997; Duan et al., 1998a), while high level, persistent transgene 

1 5 expression has been demonstrated in muscle, neurons and in other non-dividing 
cells (Kessler et al., 1996; Fisher et al., 1997; Herzog et al., 1997; Xiao et al., 
1996; Kaplitt et al., 1994; Wu et aL, 1998; Ah et al., 1996; Bennett et aL, 1997 
Westfall et al., 1997). These tissue-specific differences in rAAV mediated gene 
transfer may, in part, be due to variable levels of cellular factors affecting AAV 

20 infectivity (i.e., receptors and co-receptors such as heparin sulfate proteoglycan, 
FGFR-1, and aVp5 integrin) (Summerford et al., 1998; Qing et al, 1999; 
Summerford et al., 1999) as well as the latent life cycle (i.e., nuclear trafficking 
of virus and/or the conversion of single stranded genomes to expressible forms) 
(Qing et al., 1997; Qing et al., 1998). 

25 Muscle-mediated gene transfer represents a very promising approach for 

the treatment of hereditary myopathies and several other metabolic disorders. 
Previous studies have demonstrated remarkably efficient and persistent transgene 
expression to skeletal muscle in vivo with rAAV vectors. Applications in this 
model system include the treatment of several inherited disorders such as Factor 

30 IX deficiency in hemophilia B and epo deficiencies (Kessler et al., 1 996; Herzog 
et al., 1997). Although the conversion of low-molecular-weight rAAV genomes 
to high-molecular-weight concatamers has been inferred as evidence for 
integration of proviral DNA in the host genome, no direct evidence exists in this 

2 
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consisting of monomer genomes (<3 kb) which slowly converted to large 
multimers of >12 kb by 80 days post-infection in muscle. Importantly, 
long-term transgene expression was associated with prolonged (80 day) episomal 
persistence of these circular intermediates. Thus, in vivo persistence of rAAV 
5 can occur through episomal circularized genomes which may represent 

prointegration intermediates with increased episomal stability. Moreover, as 
described below, co-infection with adenovirus, at high multiplicities of infection 
(MOI) capable of producing early adenoviral gene products, led to increases in 
the abundance and stability of AAV circular intermediates which correlated with 

10 an elevation in transgene expression from rAAV vectors. Thus, these results 
demonstrate the existence of a molecular structure involved in AAV transduction 
which may play a role in episomal persistence and/or integration. 

Further, these results may aid in the development of non-viral or viral- 
based gene delivery systems having increased efficiency. For example, 

1 5 therapeutic or prophylactic therapies in which the present vectors are useful 
include blood disorders (e.g., sickle cell anemia, thalassemias, hemophilias, and 
Fanconi anemias), neurological disorders, such as Alzheimer's disease and 
Parkinson's disease, and muscle disorders involving skeletal, cardiac or smooth 
muscle. In particular, therapeutic genes useful in the vectors of the invention 

20 include the P-globin gene, the y-globin gene, the cystic fibrosis transmembrane 
conductance receptor gene (CFTR), the Fanconi anemia complementation group, 
a gene encoding a ribozyme, an antisense gene, a low density lipoprotein (LDL) 
gene, a tyrosine hydroxylase gene (Parkinson's disease), a glucocerebrosidase 
gene (Gaucher's disease), an arylsulfatase A gene (metachromatic 

25 leukodystrophies) or genes encoding other polypeptides or proteins. Also within 
the scope of the invention is the inclusion of more than one gene in a vector of 
the invention, i.e., a plurality of genes may be present in an individual vector. 
Further, as a circular intermediate may be a concatamer, each monomer of that 
concatamer may comprise a different gene. 

30 For viral-based delivery systems, helper-free virus can be prepared (see 

WO 95/13365) from circular intermediates or vectors of the invention. 
Alternatively, liposomes, plasmid or virosomes may be employed to deliver a 
vector of the invention to a host or host cell. 
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The increased persistence of circular intermediates or vectors having one 
or a plurality of ITRs may be due to the primary and/or secondary structure of 
the ITRs. The primary structure of a consensus sequence (SEQ ID NO:3) of 
ITRs formed by the juxtaposition and physical (phosphodiester bond) linkage of 

5 ITRs from AAV is shown in Figure 2C. However, as described hereinbelow, 
each ITR sequence may be incomplete, i.e., the ITR may be a subunit or portion 
of the fall length ITRs present in the consensus sequence. Moreover, preferably, 
an isolated DNA segment of the invention is not the 165 bp double DD sequence 
(SEQ ID NO:7) disclosed in U.S. Patent No. 5,478,745, referred to as a "double 

10 sequence'*. 

Moreover, the formation, persistence and/or abundance of molecules 
having the ITR sequences of the invention may be modulated by helper virus, 
e.g., adenoviral proteins and/or host cell proteins. Thus, the circular 
intermediates or vectors of the invention may be useful to identify and/or isolate 

1 5 proteins that bind to the ITR sequences present in those molecules. 

Therefore, the present invention provides an isolated and purified DNA 
molecule comprising at least one DNA segment, a biologically active subunit or 
variant thereof, of a circular intermediate of adeno-associated virus, which DNA 
segment confers increased episomal stability, persistence or abundance of the 

20 isolated DNA molecule in a host cell. Preferably, the DNA molecule comprises 
at least a portion of a left (5') inverted terminal repeat (ITR) of adeno-associated 
virus. Also preferably, the DNA molecule comprises at least a portion of a right 
(3')-inverted terminal repeat of adeno-associated virus. The invention also 
provides a gene transfer vector, comprising: at least one first DNA segment, a 

25 biologically active subunit or variant thereof, of a circular intermediate of adeno- 
associated virus, which DNA segment confers increased episomal stability or 
persistence of the vector in a host cell; and a second DNA segment comprising a 
gene. Preferably, the second DNA segment encodes a therapeutically effective 
polypeptide. The first DNA segment comprises ITR sequences, preferably at 

30 least about 100, more preferably at least about 300, and even more preferably at 
least about 400, bp of adeno-associated virus sequence. A preferred vector of the 
invention is aplasmid. 
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Thus, the vector of the invention is useful in a method of delivering 
and/or expressing a gene in a host cell, to prepare host cells having the vectors), 
and in the preparation of compositions comprising such vectors. To deliver the 
gene to the host cell, a recombinant adenovirus helper virus may be employed. 

5 As described hereinbelow, the tibialis muscle of mice was co-infected 

with rAAV Alkaline phosphatase (Alkphos) and GFP encoding vectors. The 
GFP shuttle vector also encoded ampicillin resistance and a bacterial origin of 
replication to allow for bacterial rescue of circular intermediates in Hirt DNA 
from infected muscle samples. There was a time dependent increase in the 

10 abundance of rescued plasmids encoding both GFP and Alkphos that reached 
33% of the total circular intermediates by 120 days post-infection. Furthermore, 
these large circular concatamers were capable of expressing both GFP and 
Alkphos encoded transgenes following transient transfection in cell lines. Thus, 
concatamerization of AAV genomes in vivo occurs through intermolecular 

15 recombination of independent monomer circular viral genomes. Therefore, a 
plurality of DNA segments, each in an individual rAAV vector, may be 
delivered so as to result in a single DNA molecule having a plurality of the DNA 
segments. For example, one rAAV vector comprises a first DNA segment 
comprising a 5* ITR linked to a second DNA segment comprising a promoter 

20 operably linked to a third DNA segment comprising a first open reading frame 
linked to a fourth DNA segment comprising a 3* ITR. A second rAAV vector 
comprises a first DNA segment comprising a 5' ITR linked to a second DNA 
segment comprising a promoter operably linked to a third DNA segment 
comprising a second open reading frame linked to a fourth DNA segment 

25 comprising a 3' ITR. 

In another embodiment, one rAAV vector comprises a first DNA 
segment comprising a 5' ITR linked to a second DNA segment comprising a 
promoter operably linked to a third DNA segment comprising the 5' end of an 
open reading frame linked to fourth DNA segment comprising a 5' splice site 

30 linked to a fifth DNA segment comprising a 3' ITR. The second rAAV vector 
comprises a first DNA segment comprising a 5* ITR linked to a second DNA 
segment comprising a 3' splice site linked to a third DNA segment comprising 
the 3' end of the open reading frame linked to a fourth DNA segment comprising 

6 
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a 3' ITR. Preferably, the second and third DNA segments together comprise 
DNA encoding, for example, CTFR, factor VIII, dystrophin, or erythropoietin. 
Also preferably, the second DNA segment comprises the endogenous promoter 
of the respective gene, e.g., the epo promoter. 
5 Thus, the invention provides a composition comprising: a first adeno- 

associated virus vector comprising linked DNA segments and at least a second 
adeno-associated virus comprising linked DNA segments. The linked DNA 
segments of the first vector comprise: a first DNA segment comprising a 5* 
ITR; a second DNA segment comprising at least a portion of an open reading 

1 0 frame operably linked to a promoter, wherein the DNA segment does not 
comprise the entire open reading frame; a third DNA segment comprising a 
splice donor site; and iv) a fourth DNA segment comprising a 3* ITR. The 
linked DNA segments of the second vector comprise a first DNA segment 
comprising a 5* ITR; a second DNA segment comprising a splice acceptor site; a 

1 5 third DNA segment comprising at least a portion of an open reading frame which 
together with the second DNA segment of the first vector encodes a full-length 
polypeptide; and a fourth DNA segment comprising a 3' ITR. Preferably, the 
second DNA segment of the first vector comprises a first exon of a gene 
comprising more than one exon and the third DNA segment of the second vector 

20 comprises at least one exon of a gene that is not the first exon. 

The invention also provides a method to transfer and express a 
polypeptide in a host cell. The method comprises contacting the host cell with at 
least two rAAV vectors. One rAAV vector comprises a first DNA segment 
comprising a 5 TTR linked to a second DNA segment comprising a promoter 

25 operably linked to a third DNA segment comprising a first open reading frame 
linked to a fourth DNA segment comprising a 3' ITR. A second rAAV vector 
comprises a first DNA segment comprising a 5' TTR linked to a second DNA 
segment comprising a promoter operably linked to a third DNA segment 
comprising a second open reading frame linked to a fourth DNA segment 

30 comprising a 3 TTR. Alternatively, one rAAV vector comprises a first DNA 
segment comprising a 5TTR linked to a second DNA segment comprising a 
promoter operably linked to a third DNA segment comprising the 5' end of an 
open reading frame linked to fourth DNA segment comprising a 5' splice site 

7 
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linked to a fifth DNA segment comprising a 3' 1TR. The second rAAV vector 
comprises a first DNA segment comprising a 5' ITR linked to a second DNA 
segment comprising' splice site linked to a third DNA segment compnsmg 
the 3' end of the open reading frame hnked to a fourthDNA segment compnsxng 
a 3 'ITR The host cell is preferably contacted with both of the vectors, 
concurrently, although it is envisioned that the host cell may be contacted wuh 
each vector at a different time relative to the contact with the other vector(s). 

Also provided is a method in which the composition of the invention ,s 
administered to the cells or tissues of an ammal. For example, rAAV vectors 
have shown promise in transferring the CFTR gene into airway epithelial cells of 
animal models and nasal sinus ofCF patient, However, high level express^ 
of CFTR has not been achieved due to the fact that AAV cannot accommodate 
the full-length CFTR gene together with a potent promoter. A number of studres 
have tried to optimize rAAV-mediated CFTR expression by utilizing truncated 
or partially deleted CFTR genes together with stronger promoters. However, it 
is currently unknown what effect deletions within the CFTR gene may have on 
complementation of bacterial colonization defects in the CF airway. Therefore, 
the present invention includes the administration to an animal of a composmon 
of the invention comprising at least two rAAV vectors which together encode 
CTFR The present invention is useful to overcome the current size hnutanon 
for transgenes within rAAV vectors, and allows for the incorporation of a larger 
transciptional regulatory region, e.g., a stronger heterologous promoter or the 
endogenous CFTR promoter. 

Briff Descriplinn oDiieJEiginfis 
Figure 1 . Structure of proviral shuttle vector and the predicted structure 
ofrAAV circular intermediate monomers. With the aid of arAAV Oactmg 
plasmid, pCisAV.GFP3ori (Panel A), AV.GFP3ori recombinant vuus was 
produced (Panel B). This vector encoded a GFP transgene cassette, an 
ampicillin resistance gene (amp), and a bacterial replication origin (on). The 
predominant form of circular btermediates isolated following transduct,on of 
Hela cells with AV.GFP3ori consisted of head-to-tail monomers (Panels C and 
D). 
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Figure 2. Structural analysis of rAAV circular intermediates in Hela 
cells. Circular rAAV intermediate clones isolated from AV.GFP3ori infected 
Hela cells were analyzed by diagnostic restriction digestion with Asel, SphI, and 
PstI together with Southern blotting against ITR, GFP, and Stuffer 32 P-labeled 

5 probes. In panel A, four clones representing the diversity of intermediates found 
(p!90, p333, p280, and p345) gave a diagnostic PstI (P) restriction pattern (3kb 
and 1 .7kb bands) consistent with a circular monomer or multimer intact genome 
[agarose gel (Left) and Southern blot (Right)]. SphI (S) digestion demonstrated 
existence of a single ITR (pl90), two ITRs in a head-to-tail orientation (p333 

10 and p280), and three ITRs (p345) in isolated circular intermediates. The 

restriction pattern of pCisAV.GFP3ori (U; uncut, P; PstI cut, S; SphI cut) and 1 
kb DNA ladder (L) are also given for comparison. One additional circular form 
(p340) was repetitively seen and had an unidentifiable structure which lacked 
intact ITR sequences. Circular concatamers were identified by partial digestion 

1 5 with Asel for clones p280 (dimer) and p333 (monomer) as is shown in Panel B. 
Sequence analysis (Panel C) of six clones with identical restriction patterns to 
p333 (Panel A) was performed using primers (indicated by arrows) juxtaposed to 
the partial p5 promoter (dotted line) and ITRs (solid line). The top sequence 
represents the proposed head-to-tail structure of intact ITR arrays with alignment 

20 of sequence derived from individual clones. The junction of the inverted ITRs is 
marked by inverted arrowheads (at 25 lbp). Several consistent bp changes 
(shaded) were noted in the 5 'ITR D-sequence (boxed) within four clones (p79, 
p81 , p87, and p88). All bp changes are indicated in lower case letters. 

Figure 3. Adenovirus augments AAV circular intermediate formation in 

25 Hela cells. Infection of Hela cells with increasing doses (0, 500, and 5000 
particles/cell) of recombinant El-deleted adenovirus (Ad.CMVlacZ) leads to 
substantial expression of E2a 72kd DNA Binding Protein, as demonstrated by 
indirect immunofluorescent staining for DBP at 72 hours post-infection (Panel 
A). Co-infection of Hela cells with Ad.CMVlacZ (5000 particles/cell) and 

30 AV.GFP3ori ( 1 000 DNA particles/cell) led to substantial augmentation of rAAV 
GFP transgene expression (Panel B). Augmentation in rAAV GFP transgene 
expression in the presence of increasing amounts (0, 500; 5000 and 10000 
particles/cell) of recombinant Ad.CMVlacZ was quantified by FACS analysis at 

9 
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72 hour post-infection (Panel C). Results demonstrate the mean (+/-SEM) for 
two experiments performed in duplicate. In addition, an aliquot of cells was split 
(1:10) at the time of FACS analysis and GFP colony forming units (CFU) per 
10X field were quantified at 6 days (CPE denotes significant cytopathic effects 

5 at an adenoviral MOI of 10,000 particles/cell and was not quantified for GFP 
colonies). Hirt DNAs from AV.GFP3ori (1000 DNA particles/cell) infected 
Hela cells with or without co-infection with Ad.CMVlacZ (5,000 particles/cell) 
were used to transform E. coli. The total number of ampicillin-resistant bacterial 
CFU (Panel D) and total number of head-to-tail circular intermediates CFU 

1 0 (Panel E) are given for a representative experiment. Greater than 20 clones for 
each time point were evaluated by Southern blot (see Figure 2 for detail). Zero 
hour controls were performed by mixing an equivalent amount of AV.GFP3ori 
virus as used in experiments with mock infected cellular lysates prior to Hirt 
purification. Panel F depicts the abundance of head-to-tail circular intermediates 

15 as a percentage of total ampicillin-resistant bacterial CFU isolated from Hirt 
DNA. 

Figure 4. Formation of rAAV head-to-tail circular intermediates 
following in vivo transduction of muscle. The tibialis anterior muscle of 4-5 
week old C57BL/6 mice were infected with AV.GFP3ori (3 X 1010 particles) in 

20 HEPES buffered saline (30 GFP expression (Panel A) was analyzed by 
direct immunofluorescence of freshly excised tissues and/or in formalin-fixed 
cryopreserved tissue sections in four independently injected muscles harvested at 
0, 5, 10, 16, 22 and 80 days post-infection. GFP expression was detected at low 
levels beginning at 10 days and was maximum at 22 days post-infection. 

25 Expression remained stable to 80 days at which time greater than 50% of the 
tissue was positive (see 80 day tissue cross section counter stained with 
propidium iodide, panel A). Hirt DNA was isolated from muscle samples at 
each of the various time points and after points was used to transform E. coli. 
Rescued plasmids (p439, pi 6, pi 7) were analyzed by Southern blotting in Panel 

30 B showing an agarose gel on left and ITR probed blot on right. U:uncut, P:PstI 
cut, and S :SphI cut. The schematic drawing of the most predominant type of 
head-to-tail circular AAV intermediate plasmids rescued from bacteria is given 
in the right of Panel B and shows the structure of pi 7 as an example. Other 

10 
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typical clones included those with less than two ITRs as shown for pi 6. SphI 
digestion of pi 6 and pi 7 plasmids released ITR hybridizing fragments of 
approximately 140 and 300 bp, respectively. The slightly lower mobility then 
predicted for these ITR fragments likely represents anomalous migration due to 

5 the high secondary structure of inverted repeats within ITRs. Sequence analysis 
of pl7 and pl6 using nested primers to 5' and 3'-ITRs also confirmed the ITR 
orientations shown to the right of the gel. Additional restriction enzyme 
analyses to determine this structure included double and single digests with 
SphI, PstI, Asel, and/or Smal. An example of an atypical clone (p439) rescued 

1 0 from bacteria with unknown structure is also shown. 

Figure 5. Frequency of circular intermediate formation in muscle 
following transduction with rAAV. Hirt DNAs isolated from rAAV infected 
tibialis muscle were used to transform E. coli and the rescued plasmids analyzed 
by Southern blotting (greater than 20 clones were analyzed from at least two 

1 5 independent muscle samples for each time point). The averages of total 

head-to-tail circular intermediate clones (line) and ampicillin resistant bacterial 
clones (bar) isolated from each tibialis anterior muscle at 0, 5, 10, 16, 22 and 80 
days post-infection are summarized in Panel A. Only plasmids which contained 
1-2 ITRs were included in the estimation of total head-to-tail circular 

20 intermediates. Plasmids which demonstrated an absence of ITR hybridizing 
SphI fragments (between 150 to 300 bp) were omitted from the calculations. 
Panel B demonstrates the diversity of ITR arrays found in head-to-tail circular 
intermediates at 80 days post-infection. This panel depicts a Southern blot 
probed with ITR sequences and represents circular intermediates with 1-3 ITRs. 

25 SphI fragments which hybridize to ITR probes indicate the size of inverted ITR 
arrays (marked by arrows to right of gel). Additional restriction enzyme analysis 
was used to determine the structure of monomer and multimer circular 
intermediates. Examples are shown for two multimer (pi 36 and p 1 43) circular 
intermediates which contain approximately three AAV genomes. Undigested 

30 plasmids of pi 36 and pl43 migrate greater than 1 2 kb and is contrasted to the 
most predominant form of head-to-tail undigested circular intermediates at 22 
days which migrate at 2.5 kb. The digestion pattern of pi 36 is consistent with a 
uniform head-to-tail configuration of three genomes which is indistinguishable 
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from digestion patterns of pi 39 which contains one circularized genome 
(undigested pi 39 migrates at 2.5 kb, data not shown, also see examples pi 7 in 
Figure 4). In contrast, pl36 depicts a more complex head-to-tail multimer 
circular intermediate which has various deletions and duplications within the 

5 ITR arrays. Predicted structure of five representative intermediates is 
schematically shown in Panel C. 

Figure 6. Molecular size of circular intermediates in muscle. Hirt DNA 
from AV.GFP3ori infected muscle was size fractionated by electrophoresis and 
various molecular weight fractions transformed into E. colL Results demonstrate 

1 0 the abundance of circular intermediates at each of the given molecular weights at 
22 and 80 days post-infection with the rAAV shuttle vector. Structure of circular 
intermediates were confirmed by Southern blot restriction analysis. 

Figure 7. Head-to-tail circular intermediates demonstrate increased 
stability of GFP expression following transient transfection in Hela cells. 

1 5 Subconfluent monolayers of Hela cells were co-transfected with p8 1 , p87, or 
pCMVGFP and pRSVlacZ as an internal control for transfection efficiency as 
described in the methods. Panel A demonstrates the expansion of GFP clones 
after one passage (arrows). Quantification of clone size and numbers are shown 
in Panel B. Clone size represents the mean raw values while clone numbers are 

20 normalized for transfection efficiency as determined by X-gal staining for 

pRSVlacZ. The data at the top of bar graph values for each construct in Panel B 
represents quantification of GFP clones after second passage (also normalized 
for transfection efficiency). Results indicate the mean (+/-SEM) of duplicate 
experiments with greater than 20 fields quantified for each experimental point. 

25 The persistence of transfected p8 1 and pCMVGFP plasmid DNA at passage-7 
post-transfection was evaluated by genomic Southern blot of total cellular DNA 
hybridized against 32 P-labeled GFP probe (Panel C, results from two independent 
transfections are shown). Uruncut, C:PstI cut. The migration of uncut dimer and 
monomer plasmids forms are marked on the left PstI digestion of the plasmids 

30 results in bands at 4.7 kb (pCMVGFP, single PstI site in plasmid) and 1 .7 kb 
(p81, two PstI sites flanking the GFP gene). To determine whether the 
head-to-tail ITR array within circular intermediates was responsible for increases 
in the persistence of GFP expression, the head-to-tail ITR DNA element was 
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subcloned into the pGL3 luciferase plasmid to generate pGL3(ITR). Results in 
Panel D compare the extent of luciferase transgene expression following 
transfection with pGL3 and pGL3(ITR) at 10 days (passage-2) post-transfection. 
Results are the mean (+/-SEM) for triplicate experiments and are normalized for 
5 transfection efficiency using a dual renilla luciferase reporter vector (pRLSV40, 
Promega). 

Figure 8. Identification of adenoviral genes responsible for augmentation 
of AAV circular intermediate formation. Hela cells were infected with 
AV.GFP3ori (1000 DNA particles/cell) in the presence of wtAd5, rf/802 (E2a- 

1 0 deleted), and dl 1 004 (E4-deleted) adenovirus (at the indicated MOIs). Total 
number of head-to-tail circular intermediates from Hirt DNA and the level of 
augmentation of GFP transgene expression (as determined by FACS) was 
quantified at 24 hours post-infection (Panel A). Results are the average of 
duplicate experiments. Panel B depicts results from Southern blot analysis of 

1 5 Hirt DNA following hybridization to a GFP P 32 -labeled probe. DNA loads were 
10% of the total Hirt yield from a 35 mm plate of Hela cells. Infections were 
carried out identically to that described for Panel A. Arrows mark replication 
form concatamers (RJQ, diraers (RQ, monomers (RfJ, and single-stranded 
AAV genomes (ssDNA). 

20 Figure 9. Model for independent mechanistic interactions of adenovirus 

with lytic and latent phase aspects of the AAV life cycle. The adenoviral E4 
gene has been shown to augment the level of rAAV second strand synthesis 
giving rise to replication form dimers (RQ and monomers (RfJ (Figure 8B). 
This augmentation leads to substantial increases in transgene expression from 

25 rAAV vectors and most closely mirrors lytic phase replication of wtAAV as 

head-to-head and tail-to-tail concatamers. In contrast, E4 expression inhibits the 
formation of head-to-tail circular intermediates of AAV. Hence, it appears that 
increases in the amount of Ri^ and Rf^ double stranded DNA genomes does not 
increase the extent of circular intermediate formation. Such findings suggest that 

30 conversion of R^ and Rf d to circular intermediates does not likely occur and 
implicates two mechanistically distinct pathway for their formation. In support 
of this hypothesis, adenoviral E2a gene expression does not enhance the 
formation ofRf^ and Rf d genomes but rather increase the abundance and/or 
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stability of head-to-tail circular intermediates. Furthermore, in the absence of 
E4, E2a gene expression does not lead to augmentation of rAAV transgene 
expression. Since circular intermediates have increased episomal stability in 
muscle and in Hela cells, this molecular structure may be important in the latent 

5 phase of AAV persistence. Alternatively, these circular intermediates may 
represent pre-integration complexes as previously hypothesized for Rep 
facilitated integration. In the absence of Rep, circular intermediates may 
accumulate episomally in rAAV infected cells. In summary, these findings 
support the notion that adenovirus may modulate both latent and lytic aspects of 

10 the AAV life cycle. 

Figure 10. Individual chemical sequence of SphI fragments from p81 (A; 
SEQ ID NO:4), p79 (B; SEQ ID NO:5), and pl202 (C; SEQ ID NO:6) AAV 
circular intermediates. The ends of the sequence (underlined) represent SphI 
restriction enzyme sites within head-to-tail circular AAV genomes cloned with 

1 5 the AV-GFP3ori shuttle virus. 

Figure 1 1 . Chemical sequence homology of three AAV circular 
intermediates with various conformations of ITR arrays. Diversity in ITR arrays 
are evident from the non-conserved bases marked in lower case. The ends of the 
sequence (underlined) represent SphI restriction enzyme sites within head-to-tail 

20 circular AAV genomes cloned with the AV.GFP3ori shuttle virus. 

Figure I2A. Palindromic repeat structure derived from chemical 
sequencing of AAV circular intermediate isolate p81 . Secondary structure of the 
sense strand is depicted in the top box with plasmid reference given below. 
Figure 12B. Palindromic repeat structure derived from chemical 

25 sequencing of AAV circular intermediate isolate p79. Secondary structure of the 
sense strand is depicted in the top box with plasmid reference given below. 

Figure 12C. Palindromic repeat structure derived from chemical 
sequencing of AAV circular intermediate isolate p79. Secondary structure of the 
sense strand is depicted in the top box with plasmid reference given below. 

30 Figure 13. Persistence of GFP expression in developing Xenopus 

embryos microinjected with AAV circular intermediate isolate p8 1 . The extent 
of GFP fluorescence in tadpoles reflects the stability of episomal or integrated 
microinjected plasmids. Bright field image on the left is of the p81 injected 
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embryo. The p81 injected embryo depicts fluorescence in nearly all cells by one 
week post-injection. In contrast, a mosaic pattern of expression in a minority of 
cells in pCisAV.GFPori injected embryos. The pCisAV.GFPori plasmid 
contains the identical promoter sequences driving GFP gene expression and two 

5 ITRs separated by stuffer sequence. These findings demonstrate that specific 
structural characteristics found within AAV circular intermediates are 
responsible for increased persistence of transgene expression. 

Figure 14. Mechanistic scheme for determining pathways for rAAV 
circular concatamer formation. The two independent vectors used in these 

10 studies, AV.Alkphos and AV.GFP3.ori, are shown in Panel A . Restriction sites 
important in the structural analysis of circular intermediates are also shown. In 
Panel B, a schematic representation of two potential models for circular 
concatamer formation is depicted, along with the methods to experimentally 
differentiate which of these processes is active in muscle. Following co- 

15 infection of the tibialis muscle with AV.Alkphos and AV.GFP3.ori, all 
subsequently rescued plasmids arise solely from circular intermediates 
containing AV.GFP3ori genomes. If rolling circular replication is the sole 
mechanism of concatamerization, only GFP expressing plasmids should be 
rescued. In contrast, if intermolecular recombination between independently 

20 formed monomer circular intermediates is the mechanism of concatamerization, 
both GFP and GFP/Alkphos expressing plasmids should be rescued. 

Figure 1 5. Co-infection of tibialis muscle of mice with AV.Alkphos and 
AV.GFP3ori. Transgene expression of rAAV infected tibialis muscle was . 
determined at 14, 35, 80 (Panels A and A'), and 120 (Panels B-D) days 

25 following co-infection with 5 x 10 9 DNA particles each of AVAlkphos and 
AV.GFP3ori. The time course of transgene expression started around 14 days 
and peaked by 35-80 days. The extent of co-infection of myofibers with both 
Alkphos and GFP rAAV was determined in serial sections of 80 and 120 day 
post-infection muscle samples. Panels A-C represent GFP fluorescence of 

30 formalin fixed, cryoprotected sections, while panels A'-C depict the 

histochemical staining for Alkaline phosphatase in adjacent serial sections. A 
short staining time (7 minutes) was necessary to observe variation in staining 
levels for comparison to GFP. It was found that longer staining times 
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(30 minutes) saturated the Alkphos signal. The boxed region in panels B and B' 
are enlarged in panels C and C, respectively. A more precise correlation of GFP 
and Alkphos staining in myofibers is given in Panel D in which co-localization 
of GFP and Alkphos expression was examined in the same section of a 120 day 

5 post-infected sample. This was performed by photographing the GFP 

fluorescent image prior to staining for Alkphos activity. The left panel of D 
shows a high power Nomarski photomicrograph of a group of myofibers (traced 
in red) , while the corresponding GFP and Alkphos staining patterns are shown 
in the right panel. Photomicrographs of Alkphos staining were taken with a red 

10 filter to allow for superimposition of staining patterns with GFP fluorescence. 
Co-expression of Alkphos and GFP is shown within myofibers as a 
yellow/orange color. Myofibers are marked as follows: (-) negative for both 
Alkphos and GFP, (*) positive for only GFP, and (+) positive for both GFP and 
Alkphos. 

15 Figure 16. Rescue of circular intermediates and characterization of DNA 

hybridization patterns. Using the ampicillin resistance gene (amp) and bacterial 
ori incorporated into the AV.GFP3ori vector, the extent of circular intermediate 
formation was assessed by rescuing amp resistant plasmids following 
transformation of 1/5 the isolated Hirt DNA into E. coli Sure cells. Twenty 

20 plasmids from each muscle sample were prepared and analyzed by slot blot 
hybridization against GFP, Alkphos, and Amp 32 P-labeled DNA probes. A 
representative group demonstrating the hybridization patterns is shown in Panel 
A. Panel B depicts the mean (+/-SEM) number of rescued bacterial plasmids 
that hybridized to either GPF alone, or to both GFP and Alkphos probes, 

25 following transformation of 1/5* of the Hirt DNA. These numbers were 

calculated from the percentage of plasmids hybridizing to GPF and/or Alkphos 
and the total GFU plating efficiency derived from the original transformation. In 
total, 3 independent muscle samples were analyzed for a total of 60 plasmids at 
each time point. The percentage of GFP hybridization positive rescued plasmids 

30 that also demonstrated hybridization to Alkphos is shown in Panel C. These data 
demonstrate an increase in the abundance of rescued GFP/Alkphos co-encoding 
circular intermediates over time. 
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Figure 17. Transgene expression from rescued circular intermediates. 
Rescued circular intermediate plasmids were transfected into 293 cells for 
assessment of their ability to express encoded transgenes. In these studies all 
GFP hybridization positive clones from at least two muscles were tested for each 
5 time point and scored for their ability to express GFP and Alkaline phosphatase. 
In total at least 40 clones were evaluated for each time point Three patterns of 
transgene expression were observed following transfection of these plasmids: I) 
no gene expression (Panel A), II) GFP expression only (Panel B), and IH) GFP 
and Alkphos expression (Panel C). Panels A-C depict Nomarski 
10 photomicrographs (left) of GFP fluorescent fields (center) and Alkphos staining 
of a different field from the same culture (right). The percentage of GFP 
hybridization positive clones that also expressed GFP is shown in Panel D. 
Additionally, this panel illustrates the percentage of GFP expressing clones also 
expressing Alkphos. 
15 Figure 18. Structural analysis of bi-functional concatamer circular 

intermediates. To fully characterize the nature of GFP and Alkphos co- 
expressing circular intermediates, detailed structural analyses were performed 
using restriction enzyme mapping and Southern blot hybridization with GFP, 
Alkphos, and ITR 32 P-labeled probes. Results from Southern blot analysis of 
20 plasmid clone #33 (Panel A) and clone #5 (Panel C) are given as representative 
examples of circular intermediates isolated from 80 and 35 day Hirt DNA of 
rAAV infected muscle, respectively. Agarose gels were run in triplicate for each 
of these clones and Southern blot filters were hybridized with one of the three 
DNA probes as indicated below each autoradiogram. Molecular weights (kb) are 
25 indicated to the left of the ethidium stained agarose gel and restriction enzymes 
are marked on the top of each gel/filter. Panels B and D give the deduced 
structure of plasmid clones #33 and #5, respectively, as based on Southern blot 
analysis. For ease of comparison with the restriction maps of the viral genomes 
given in Figure 14 A, the position of restriction enzyme sites (kb) are marked 
30 with the indicated orientation of intact viral genomes. However, in clone #33 a 
deletion occurred between the Asel and Hindm site of a head-to-tail array 
between AV.Alkphos and AV.GFP3ori, as reflected by a 900 bp reduction in the 
anticipated size of HindUI/Notl and Clal/Asel fragments (marked by asterisks in 
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Panel A). Furthermore, the SphI site flanking an ITR was ablated in clone #5 
(bands effected by this deletion are marked by asterisks in Panel C). The 
deletion is not reflected in the overall concatamer since the exact region involved 
and/or the size of the deletion is unclear. Additionally, chemical sequence 

5 evidence of rescued circular intermediates suggests that the predominant form of 
ITR arrays may be in a double-D structure (ie., one ITR flanked by two D- 
sequence rather than two ITRs) and hence ITR arrays containing fragments may 
appear 147 bp shorter than indicated. However, to more easily depict the 
orientation of viral genomes, the position of 5 'and 3' ITRs is indicated rather 

1 0 than representing a single ITR at these junctions. 

Figure 19. Application of rAAV circular concatamers to deliver trans- 
splicing vectors with large gene inserts. Panel A depicts two rAAV vectors 
encoding two halves of a cDNA (red) and flanked by splice site consensus 
sequences (brown). Panel B depicts one potential type of intermolecular 

1 5 concatamer following co-infection of cells with the independent vectors shown 
in panel A. Full length transgene mRNA can then be produced by splicing 
between these two vector encoded sequences within circular concatamers. 
Detailed Description of the Invention 

Definitions 

20 As used herein, the terms "isolated and/or purified" refer to in vitro 

preparation, isolation and/or purification of a nucleic acid molecule of the 
invention, so that it is not associated with in vivo substances. 

As used herein, a DNA molecule, sequence or segment of the invention 
preferably is biologically active. A biologically active DNA molecule of the 

25 invention has at least about 1%, more preferably at least about 10%, and more 
preferably at least about 50%, of the activity of a DNA molecule comprising ITR 
sequences from a circular intermediate of AAV, e.g., a DNA molecule 
comprising SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, or a 
subunit or variant thereof. The activity of a nucleic acid molecule of the 

30 invention can be measured by methods well known to the art, some of which are 
described hereinbelow. For example, the presence of the DNA molecule in a 
recombinant nucleic acid molecule in a host cell results in episomal persistence 
and/or increased abundance of the recombinant molecule in those cells relative to 
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corresponding cells having a recombinant nucleic acid molecule lacking a DNA 
molecule of the invention. 

A variant DNA molecule, sequence or segment of the invention has at 
least about 70%, preferably at least about 80%, and more preferably at least 

5 about 90%, but less than 1 00%, contiguous nucleotide sequence homology or 
identity to a DNA molecule comprising ITR sequences from a circular 
intermediate of AAV, e.g., SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ 
ID NO:6, a subunit thereof. A variant DNA molecule of the invention may 
include nucleotide bases not present in SEQ ID NO:3, SEQ ID NO:4, SEQ ID 

10 NO:5, SEQ ID NO:6, e.g., 5', 3' or internal deletions or insertions, such as the 
insertion of a restriction endonuclease recognition site, so long as these bases do 
not substantially reduce the biological activity of the molecule. A substantial 
reduction in activity means a reduction in activity of greater than about 50%, 
preferably greater than about 90%. 

15 I. Identificatinn nf Nucleic Arid Molecules Falling Within the Scone o f the 
Invention 

A Nucleic Acid Molec ules of the Invention 

1 . Sources of the Nucleic Aci H Molecules of the Invention 

Sources of nucleotide sequences from which the present nucleic acid 

20 molecules can be obtained include AAV infected cells, e.g., any vertebrate, 
preferably mammalian, cellular source. 

As used herein, the terms "isolated and/or purified" refer to in vitro 
isolation of a nucleic acid, e.g., DNA molecule from its natural cellular 
environment, and from association with other components of the cell, such as 

25 nucleic acid or polypeptide, so that it can be sequenced, replicated, and/or 
expressed. For example, "isolated nucleic acid" is UNA or DNA containing 
greater than about 50, preferably about 300, and more preferably about 500 or 
more, sequential nucleotide bases that comprise a DNA segment from a circular 
intermediate of AAV which contains at least a portion of the 5' and 3' URs and 

30 the D sequence, or a variant thereof, that is complementary or hybridizes, 
respectively, to AAV ITR DNA and remains stably bound under stringent 
conditions, as defined by methods well known in the art, e.g., in Sambrook et al., 
1989. Thus, the RNA or DNA is "isolated" in that it is free from at least one 
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contaminating nucleic acid with which it is normally associated in the natural 
source of the RNA or DNA and is preferably substantially free of any other 
mammalian RNA or DNA. The phrase "free from at least one contaminating 
source nucleic acid with which it is normally associated" includes the case where 
5 the nucleic acid is reintroduced into the source or natural cell but is in a different 
chromosomal location or is otherwise flanked by nucleic acid sequences not 
normally found in the source cell, e.g., in a vector or plasmid. An example of 
isolated nucleic acid within the scope of the invention is nucleic acid that shares 
at least about 80%, preferably at least about 90%, and more preferably at least 
10 about 95%, sequence identity with SEQ ED NO:3, SEQ ID NO:4, SEQ ID NO:5 
or SEQ ID NO:6, or a subunit thereof. 

As used herein, the term "recombinant nucleic acid" or "preselected 
nucleic acid " e.g., "recombinant DNA sequence or segment" or "preselected 
DNA sequence or segment" refers to a nucleic acid, e.g., to DNA, that has been 
1 5 derived or isolated from any appropriate cellular source, that may be 

subsequently chemically altered in vitro y so that its sequence is not naturally 
occurring, or corresponds to naturally occurring sequences that are not 
positioned as they would be positioned in a genome which has not been 
transformed with exogenous DNA. An example of preselected DNA "derived" 
20 from a source, would be a DNA sequence that is identified as a useful fragment 
within a given organism, and which is then chemically synthesized in essentially 
pure form. An example of such DNA "isolated" from a source would be a useful 
DNA sequence that is excised or removed from said source by chemical means, 
e.g., by the use of restriction endonucleases, so that it can be further 
25 manipulated, e.g., amplified, for use in the invention, by the methodology of 
genetic engineering. 

Thus, recovery or isolation of a given fragment of DNA from a restriction 
digest can employ separation of the digest on polyacrylamide or agarose gel by 
electrophoresis, identification of the fragment of interest by comparison of its 
30 mobility versus that of marker DNA fragments of known molecular weight, 

removal of the gel section containing the desired fragment, and separation of the 
gel from DNA. See Lawn et al., Nuclfiic A "H* Re * 2, 6103 (1981), and 
Goeddel et al., Nucleic Adds Res. T fi, 4057 (1980). Therefore, ^preselected 
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DNA" includes completely synthetic DNA sequences, semi-synthetic DNA 
sequences, DNA sequences isolated from biological sources, and DNA 
sequences derived from RNA, as well as mixtures thereof. 

Nucleic acid molecules having base pair substitutions (i.e., variants) are 

5 prepared by a variety of methods known in the art. These methods include, but 
are not limited to, isolation from a natural source (in the case of naturally 
occurring sequence variants) or preparation by oligonucleotide-mediated (or site- 
directed) mutagenesis, PCR mutagenesis, and cassette mutagenesis of an earlier 
prepared variant or a non-variant version of the nucleic acid molecule. 

1 0 Oligonucleotide-mediated mutagenesis is a preferred method for 

preparing substitution variants. This technique is well known in the art as 
described by Adelman et al., DNA, 2, 183 (1983). Briefly, AAV DNA is altered 
by hybridizing an oligonucleotide encoding the desired mutation to a DNA 
template, where the template is the single-stranded form of a plasmid or 

1 5 bacteriophage containing the unaltered or native DNA sequence of AAV. After 
hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template that will thus incorporate the 
oligonucleotide primer, and will code for the selected alteration in the AAV 
DNA. 

20 Generally, oligonucleotides of at least 25 nucleotides in length are used. 

An optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 
mutation. This ensures that the oligonucleotide will hybridize properly to the 
single-stranded DNA template molecule. The oligonucleotides are readily 

25 synthesized using techniques known in the ait such as that described by Crea et 
al., TW Natl Acad. ScL U.S.A. . 25, 5765 (1978). 

The DNA template can be generated by those vectors that are either 
derived from bacteriophage M13 vectors (the commercially available M13mpl8 
and M13mpl9 vectors are suitable), or those vectors that contain a single- 

30 stranded phage origin of replication as described by Viera et al, Mfith Enzymc l., 
151, 3 (1987). Thus, the DNA that is to be mutated may be inserted into one of 
these vectors to generate single-stranded template. Production of the single- 
stranded template is described in Sections .4.21-4.41 of Sambrook et al., 
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Molecular CtrmiTig- A T ,ahoratory Manual (Cold Spring Harbor Laboratory 
Press, N.Y. 1989). 

Alternatively, single-stranded DNA template may be generated by 
denaturing double-stranded plasmid (or other) DNA using standard techniques. 
5 For alteration of the native DNA sequence (to generate amino acid 

sequence variants, for example), the oligonucleotide is hybridized to the single- 
stranded template under suitable hybridization conditions. A DNA polymerizing 
enzyme, usually the Klenow fragment of DNA polymerase I, is then added to 
synthesize the complementary strand of the template using the oligonucleotide as 

10 a primer for synthesis. A heteroduplex molecule is thus formed such that one 
strand of DNA encodes the mutated form of AAV, and the other strand (the 
original template) encodes the native, unaltered sequence of AAV. This 
heteroduplex molecule is then transformed into a suitable host cell, usually a 
prokaryote such as E. coli JM101 . After the cells are grown, they are plated onto 

1 5 agarose plates and screened using the oligonucleotide primer radiolabeled with 
32-phosphate to identify the bacterial colonies that contain the mutated DNA. 
The mutated region is then removed and placed in an appropriate vector, 
generally an expression vector of the type typically employed for transformation 
of an appropriate host. 

20 The method described immediately above may be modified such that a 

homoduplex molecule is created wherein both strands of the plasmid contain the 
mutations(s). The modifications are as follows: The single-stranded 
oligonucleotide is annealed to the single-stranded template as described above. 
A mixture of three deoxyribonucleotides, deoxyriboadenosine (dATP), 

25 deoxyriboguanosine (dGTP), and deoxyribothymidine (dTTP), is combined with 
a modified thiodeoxyribocytosine called dCTP-(aS) (which can be obtained 
from the Amersham Corporation). This mixture is added to the template- 
oligonucleotide complex. Upon addition of DNA polymerase to this mixture, a 
strand of DNA identical to the template except for the mutated bases is 

30 generated. In addition, this new strand of DNA will contain dCTP-(aS) instead 
of dCTP, which serves to protect it from restriction endonuclease digestion. 

After the template strand of the double-stranded heteroduplex is nicked 
with an appropriate restriction enzyme, the template strand can be digested with 

22 



WO 99/60146 



PCT/US99/11197 



ExoIII nuclease or another appropriate nuclease past the region that contains the 
site(s) to be mutagenized. The reaction is then stopped to leave a molecule that 
is only partially single-stranded. A complete double-stranded DNA homoduplex 
is then formed using DNA polymerase in the presence of all four 
5 deoxyribonucleotide triphosphates, ATP, and DNA ligase. This homoduplex 
molecule can then be transformed into a suitable host cell such as E. coli JM101 . 

For example, a preferred embodiment of the invention is an isolated and 
purified DNA molecule comprising a DNA segment comprising SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, a subunit thereof or a variant 

1 0 thereof having nucleotide substitutions, or deletions or insertions. 

II Preparahon of Molecules Useful to Practice the Methods of the I nvention 

A. Nucleic Acid Molecules 

1 , Chimeric Expression Cassettes 

To prepare expression cassettes for transformation herein, the 

1 5 recombinant or preselected DNA sequence or segment may be circular or linear, 
double-stranded or single-stranded. Generally, the preselected DNA sequence or 
segment is in the form of chimeric DNA, such as plasmid DNA, that can also 
contain coding regions flanked by control sequences which promote the 
expression of the preselected DNA present in the resultant cell line. 

20 As used herein, "chimeric" means that a vector comprises DNA from at 

least two different species, or comprises DNA from the same species, which is 
linked or associated in a manner which does not occur in the "native" or wild 
type of the species. 

Aside from the preselected DNA sequences described above, a portion of 

25 the preselected DNA may serve a regulatory or a structural function. For 

example, the preselected DNA may itself comprise a promoter that is active in 
mammalian cells, or may utilize a promoter already present in the genome that is 
the transformation target. Such promoters include the CMV promoter, as well as 
the SV40 late promoter and retroviral LTRs (long terminal repeat elements), 

30 although many other promoter elements well known to the art may be employed 
in the practice of the invention. 

Other elements functional in the host cells, such as introns, enhancers, 
polyadenylation sequences and the like, may also be a part of the preselected 
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DNA. Such elements may or may not be necessary for the function of the DNA, 
but may provide improved expression of the DNA by affecting transcription, 
stability of the mRNA, or the like. Such elements may be included in the DNA 
as desired to obtain the optimal performance of the transforming DNA in the 
5 cell. 

"Control sequences" is defined to mean DNA sequences necessary for 
the expression of an operably linked coding sequence in a particular host 
organism. The control sequences that are suitable for prokaryotic cells, for 
example, include a promoter, and optionally an operator sequence, and a 

10 ribosome binding site. Eukaryotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 

"Operably linked" is defined to mean that the nucleic acids are placed in 
a functional relationship with another nucleic acid sequence. For example, DNA 
for a presequence or secretory leader is operably linked to DNA for a peptide or 

15 polypeptide if it is expressed as a preprotein that participates in the secretion of 
the peptide or polypeptide; a promoter or enhancer is operably linked to a coding 
sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the DNA sequences being 

20 linked are contiguous and, in the case of a secretory leader, contiguous and in 
reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not 
exist, the synthetic oligonucleotide adaptors or linkers are used in accord with 
conventional practice. 

25 The preselected DNA to be introduced into the cells further will generally 

contain either a selectable marker gene or a reporter gene or both to facilitate 
identification and selection of transformed cells from the population of cells 
sought to be transformed. Alternatively, the selectable marker may be carried on 
a separate piece of DNA and used in a co-transformation procedure. Both 

30 selectable markers and reporter genes may be flanked with appropriate 

regulatory sequences to enable expression in the host cells. Useful selectable 
markers are well known in the art and include, for example, antibiotic and 
herbicide-resistance genes, such as mo, hpt, dhfr, bar, aroA, dapA and the like. 
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See also, the genes listed on Table 1 of Lundquist et al. (U.S. Patent No. 
5,848,956). 

Reporter genes are used for identifying potentially transformed cells and 
for evaluating the functionality of regulatory sequences. Reporter genes which 

5 encode for easily assayable proteins are well known in the art. In general, a 
reporter gene is a gene which is not present in or expressed by the recipient 
organism or tissue and which encodes a protein whose expression is manifested 
by some easily detectable property, e.g., enzymatic activity. Preferred genes 
include the chloramphenicol acetyl transferase gene (cat) from Tn9 of E. coli, the 

10 beta-glucuronidase gene (gus) of the uidA locus of E. colU and the luciferase 
gene from firefly Photinus pyralis. Expression of the reporter gene is assayed at 
a suitable time after the DNA has been introduced into the recipient cells. 

The general methods for constructing recombinant DNA which can 
transform target cells are well known to those skilled in the art, and the same 

1 5 compositions and methods of construction may be utilized to produce the DNA 
useful herein. For example, J. Sambrook et al., Molecular Honing: A 
T^hnratnry Manual . Cold Spring Harbor Laboratory Press (2d ed., 1989), 
provides suitable methods of construction. 
7 Transformation into Host Cells 

20 The recombinant DNA can be readily introduced into the host cells, e.g., 

mammalian, bacterial, yeast or insect cells by transfection with an expression 
vector of the invention, by any procedure useful for the introduction into a 
particular cell, e.g., physical or biological methods, to yield a transformed cell 
having the recombinant DNA stably integrated into its genome or present as an 

25 episome which can persist in the transformed cells, so that the DNA molecules, 
sequences, or segments, of the present invention are maintained and/or expressed 
by the host cell. 

Physical methods to introduce a preselected DNA into a host cell include 
calcium phosphate precipitation, lipofection, particle bombardment, 
30 microinjection, electroporation, and the like. Biological methods to introduce 
the DNA of interest into a host cell include the use of DNA and RNA viral 
vectors. The main advantage of physical methods is that they are not associated 
with pathological or oncogenic processes of viruses. However, they are less 
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precise, often resulting in multiple copy insertions, random integration, 
disruption of foreign and endogenous gene sequences, and unpredictable 
expression. 

As used herein, the term "cell line" or "host cell" is intended to refer to 
5 well-characterized homogenous, biologically pure populations of cells. These 
cells may be eukaryotic cells that are neoplastic or which have been 
"immortalized" in vitro by methods known in the art, as well as primary cells, or 
prokaryotic cells. The cell line or host cell is preferably of mammalian origin, 
but cell lines or host cells of non-mammalian origin may be employed, including 
10 plant, insect, yeast, fiingal or bacterial sources. Generally, the preselected DNA 
sequence is related to a DNA sequence which is resident in the genome of the 
host cell but is not expressed, or not highly expressed, or, alternatively, 
overexpressed. 

"Transfected" or "transformed" is used herein to include any host cell or 
1 5 cell line, the genome of which has been altered or augmented by the presence of 
at least one preselected DNA sequence, which DNA is also referred to in the art 
of genetic engineering as "heterologous DNA," "recombinant DNA," 
"exogenous DNA," "genetically engineered," "non-native," or "foreign DNA," 
wherein said DNA was isolated and introduced into the genome of the host cell 
20 or cell line by the process of genetic engineering. The host cells of the present 
invention are typically produced by transfection with a DNA sequence in a 
plasmid expression vector, a viral expression vector, or as an isolated linear 
DNA sequence. 

To confirm the presence of the preselected DNA sequence in the host 
25 cell, a variety of assays may be performed. Such assays include, for example, 
"molecular biological" assays well known to those of skill in the art, such as 
Southern and Northern blotting, RT-PCR and PCR; "biochemical" assays, such 
as detecting the presence of a polypeptide expressed from a gene present in the 
vector, e.g., by immunological means (immunoprecipitations, immunoaffinity 
30 columns, ELISAs and Western blots) or by any other assay useful to identify 
molecules falling within the scope of the invention. 

To detect and quantitate RNA produced from introduced DNA segments, 
RT-PCR may be employed. In this application of PCR, it is first necessary to 
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reverse transcribe RNA into DNA, using enzymes such as reverse transcriptase, 
and then through the use of conventional PCR techniques amplify the DNA. In 
most instances PCR techniques, while useful, will not demonstrate integrity of 
the RNA product. Further information about the nature of the RNA product may 
5 be obtained by Northern blotting. This technique demonstrates the presence of 
an RNA species and gives information about the integrity of that RNA. The 
presence or absence of an RNA species can also be determined using dot or slot 
blot Northern hybridizations. These techniques are modifications of Northern 
blotting and only demonstrate the presence or absence of an RNA species. 

1 0 While Southern blotting and PCR may be used to detect the DNA 

segment in question, they do not provide information as to whether the DNA 
segment is being expressed. Expression may be evaluated by specifically 
identifying the polypeptide products of the introduced DNA sequences or 
evaluating the phenotypic changes brought about by the expression of the 

15 introduced DNA segment in the host cell 

ELL Dosages, Form ulations and Routes of Administration 

Administration of a nucleic acid molecule may be accomplished through 
the introduction of cells transformed with the nucleic acid molecule (see, for 
example, WO 93/02556), the administration of the nucleic acid molecule itself 

20 (see, for example, Feigner et al., U.S. Patent No. 5,580,859, Pardoll et al., 
Immunity 2, 165 (1995); Stevenson et al., Immunol. Rev., 145, 211 (1995); 
Moiling, J. MoL Med. . 25, 242 (1997); Donnelly et al., Ann, N.Y. Acad. Sci., 
222, 40 (1995); Yang et aL, Mol Med Today. 2, 476 (1996); Abdallah et al., 
Biol Hell , £5, 1 (1995)), through infection with a recombinant virus or via 

25 liposomes. Pharmaceutical formulations, dosages and routes of administration 
for nucleic acids are generally disclosed, for example, in Feigner et al., supra. 

Administration of the therapeutic agents in accordance with the present 
invention may be continuous or intermittent, depending, for example, upon the 
recipient's physiological condition, whether the purpose of the administration is 

30 therapeutic or prophylactic, and other factors known to skilled practitioners. The 
administration of the agents of the invention may be essentially continuous over 
a preselected period of time or may be in a series of spaced doses. Both local 
and systemic administration is contemplated. When the molecules of the 
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invention are employed for prophylactic purposes, agents of the invention are 
amenable to chronic use, preferably by systemic administration. 

One or more suitable unit dosage forms comprising the therapeutic agents 
of the invention, which, as discussed below, may optionally be formulated for 
5 sustained release, can be administered by a variety of routes including oral, or 
parenteral, including by rectal, transdermal, subcutaneous, intravenous, 
intramuscular, intraperitoneal, intrathoracic, intrapulmonary and intranasal 
routes. The formulations may, where appropriate, be conveniently presented in 
discrete unit dosage forms and may be prepared by any of the methods well 

1 0 known to pharmacy. Such methods may include the step of bringing into 

association the therapeutic agent with liquid carriers, solid matrices, semi-solid 
carriers, finely divided solid carriers or combinations thereof, and then, if 
necessary, introducing or shaping the product into the desired delivery system. 
When the therapeutic agents of the invention are prepared for oral 

1 5 administration, they are preferably combined with a pharmaceutically acceptable 
carrier, diluent or excipient to form a pharmaceutical formulation, or unit dosage 
form. The total active ingredients in such formulations comprise from 0.1 to 
99.9% by weight of the formulation. By "pharmaceutically acceptable" it is 
meant the carrier, diluent, excipient, and/or salt must be compatible with the 

20 other ingredients of the formulation, and not deleterious to the recipient thereof. 
The active ingredient for oral administration may be present as a powder or as 
granules; as a solution, a suspension or an emulsion; or in achievable base such 
as a synthetic resin for ingestion of the active ingredients from a chewing gum. 
The active ingredient may also be presented as a bolus, electuary or paste. 

25 Pharmaceutical formulations containing the therapeutic agents of the 

invention can be prepared by procedures known in the art using well known and 
readily available ingredients. For example, the agent can be formulated with 
common excipients, diluents, or carriers, and formed into tablets, capsules, 
suspensions, powders, and the like. Examples of excipients, diluents, and 

30 carriers that are suitable for such formulations include the following fillers and 
extenders such as starch, sugars, mannitol, and silicic derivatives; binding agents 
such as carboxymethyl cellulose, HPMC and other cellulose derivatives, 
alginates, gelatin, and polyvinylpyrrolidone; moisturizing agents such as 

28 



WO 99/60146 



PCT/US99/11197 



glycerol; disintegrating agents such as calcium carbonate and sodium 
bicarbonate; agents for retarding dissolution such as paraffin; resorption 
accelerators such as quaternary ammonium compounds; surface active agents 
such as cetyl alcohol, glycerol monostearate; adsorptive carriers such as kaolin 
5 and bentonite; and lubricants such as talc, calcium and magnesium stearate, and 
solid polyethyl glycols. 

For example, tablets or caplets containing the agents of the invention can 
include buffering agents such as calcium carbonate, magnesium oxide and 
magnesium carbonate. Caplets and tablets can also include inactive ingredients 

1 0 such as cellulose, pregelatinized starch, silicon dioxide, hydroxy propyl methyl 
cellulose, magnesium stearate, microcrystalline cellulose, starch, talc, titanium 
dioxide, benzoic acid, citric acid, com starch, mineral oil, polypropylene glycol, 
sodium phosphate, and zinc stearate, and the like. Hard or soft gelatin capsules 
containing an agent of the invention can contain inactive ingredients such as 

1 5 gelatin, microcrystalline cellulose, sodium lauryl sulfate, starch; talc, and 
titanium dioxide, and the like, as well as liquid vehicles such as polyethylene 
glycols (PEGs) and vegetable oil. Moreover, enteric coated caplets or tablets of 
an agent of the invention are designed to resist disintegration in the stomach and 
dissolve in the more neutral to alkaline environment of the duodenum. 

20 The therapeutic agents of the invention can also be formulated as elixirs 

or solutions for convenient oral administration or as solutions appropriate for 
parenteral administration, for instance by intramuscular, subcutaneous or 
intravenous routes. 

The pharmaceutical formulations of the therapeutic agents of the 

25 invention can also take the form of an aqueous or anhydrous solution or 
dispersion, or alternatively the form of an emulsion or suspension. 

Thus, the therapeutic agent may be formulated for parenteral 
administration (e.g., by injection, for example, bolus injection or continuous 
infusion) and may be presented in unit dose form in ampules, pre-filled syringes, 

30 small volume infusion containers or in multi-dose containers with an added 
preservative. The active ingredients may take such forms as suspensions, 
solutions, or emulsions in oily or aqueous vehicles, and may contain formulatory 
agents such as suspending, stabilizing and/or dispersing agents. Alternatively, 
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the active ingredients may be in powder form, obtained by aseptic isolation of 
sterile solid or by lyophilization from solution, for constitution with a suitable 
vehicle, e.g., sterile, pyrogen-free water, before use. 

These formulations can contain phannaceutically acceptable vehicles and 

5 adjuvants which are well known in the prior art. It is possible, for example, to 
prepare solutions using one or more organic solvent(s) that is/are acceptable 
from the physiological standpoint, chosen, in addition to water, from solvents 
such as acetone, ethanol, isopropyl alcohol, glycol ethers such as the products 
sold under the name "Dowanol", poly glycols and polyethylene glycols, C r C 4 

10 alkyl esters of short-chain acids, preferably ethyl or isopropyl lactate, fatty acid 
triglycerides such as the products marketed under the name "Miglyol", isopropyl 
myristate, animal, mineral and vegetable oils and polysiloxanes. 

The compositions according to the invention can also contain thickening 
agents such as cellulose and/or cellulose derivatives. They can also contain 

1 5 gums such as xanthan, guar or carbo gum or gum arabic, or alternatively 
polyethylene glycols, bentones and montmorillonites, and the like- 
It is possible to add, if necessary, an adjuvant chosen from antioxidants, 
surfactants, other preservatives, film-forming, keratolytic or comedolytic agents, 
perfumes and colorings. Also, other active ingredients may be added, whether 

20 for the conditions described or some other condition. 

For example, among antioxidants, t-butylhydroquinone, butylated 
hydroxyanisole, butylated hydroxytoluene and a-tocopherol and its derivatives 
may be mentioned. The galenical forms chiefly conditioned for topical 
application take the form of creams, milks, gels, dispersion or microemulsions, 

25 lotions thickened to a greater or lesser extent, impregnated pads, ointments or 
sticks, or alternatively the form of aerosol formulations in spray or foam form or 
alternatively in the form of a cake of soap. 

Additionally, the agents are well suited to formulation as sustained 
release dosage forms and the like. The formulations can be so constituted that 

30 they release the active ingredient only or preferably in a particular part of the 
intestinal or respiratory tract, possibly over a period of time. The coatings, 
envelopes, and protective matrices may be made, for example, from polymeric 
substances, such as polylactide-glycolates, liposomes, microemulsions, 
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microparticles, nanoparticles, or waxes. These coatings, envelopes, and 
protective matrices are useful to coat indwelling devices, e.g., stents, catheters, 
peritoneal dialysis tubing, and the like. 

The therapeutic agents of the invention can be delivered via patches for 
5 transdermal administration. See U.S. Patent No. 5,560,922 for examples of 
patches suitable for transdermal delivery of a therapeutic agent. Patches for 
transdermal delivery can comprise a backing layer and a polymer matrix which 
has dispersed or dissolved therein a therapeutic agent, along with one or more 
skin permeation enhancers. The backing layer can be made of any suitable mate- 

10 rial which is impermeable to the therapeutic agent. The backing layer serves as a 
protective cover for the matrix layer and provides also a support function. The 
backing can be formed so that it is essentially the same size layer as the polymer * 
matrix or it can be of larger dimension so that it can extend beyond the side of 
the polymer matrix or overlay the side or sides of the polymer matrix and then 

1 5 can extend outwardly in a manner that the surface of the extension of the backing 
layer can be the base for an adhesive means. Alternatively, the polymer matrix 
can contain, or be formulated of, an adhesive polymer, such as polyacrylate or 
acrylate/vinyl acetate copolymer. For long-term applications it might be desir- 
able to use microporous and/or breathable backing laminates, so hydration or 

20 maceration of the skin can be minimized. 

Examples of materials suitable for making the backing layer are films of 
high and low density polyethylene, polypropylene, polyurethane, 
polyvinylchloride, polyesters such as poly(ethylene phthalate), metal foils, metal 
foil laminates of such suitable polymer films, and the like. Preferably, the 

25 materials used for the backing layer are laminates of such polymer films with a 
metal foil such as aluminum foil. In such laminates, a polymer film of the 
laminate will usually be in contact with the adhesive polymer matrix. 

The backing layer can be any appropriate thickness which will provide 
the desired protective and support functions. A suitable thickness will be from 

30 about 1 0 to about 200 microns. 

Generally, those polymers used to form the biologically acceptable 
adhesive polymer layer are those capable of forming shaped bodies, thin walls or 
coatings through which therapeutic agents can pass at a controlled rate. Suitable 
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polymers are biologically and pharmaceutical^ compatible, nonallergenic and 
insoluble in and compatible with body fluids or tissues with which the device is 
contacted. The use of soluble polymers is to be avoided since dissolution or 
erosion of the matrix by skin moisture would affect the release rate of the 
5 therapeutic agents as well as the capability of the dosage unit to remain in place 
for convenience of removal. 

Exemplary materials for fabricating the adhesive polymer layer include 
polyethylene, polypropylene, polyurethane, ethylene/propylene copolymers, 
ethylene/ethylacrylate copolymers, ethylene/vinyl acetate copolymers, silicone 

1 0 elastomers, especially the medical-grade polydimethylsiloxanes, neoprene 
rubber, polyisobutylene, polyacrylates, chlorinated polyethylene, polyvinyl 
chloride, vinyl chloride-vinyl acetate copolymer, crosslinked polymethacrylate 
polymers (hydrogel), polyvinylidene chloride, polyethylene terephthalate), butyl 
rubber, epichlorohydrin rubbers, ethylenvinyl alcohol copolymers, ethylene- 

1 5 vinyloxyethanol copolymers; silicone copolymers, for example, polysiloxane- 
polycarbonate copolymers, polysiloxanepolyethylene oxide copolymers, 
polysiloxane-polymethacrylate copolymers, polysiloxane-alkylene copolymers 
(e.g., polysiloxane-ethylene copolymers), polysiloxane-alkylenesilane 
copolymers (e.g., polysiloxane-ethylenesilane copolymers), and the like; 

20 cellulose polymers, for example methyl or ethyl cellulose, hydroxy propyl 

methyl cellulose, and cellulose esters; polycarbonates; polytetrafluoroethylene; 
and the like. 

Preferably, a biologically acceptable adhesive polymer matrix should be 
selected from polymers with glass transition temperatures below room 

25 temperature. The polymer may, but need not necessarily, have a degree of 

crystallinity at room temperature. Cross-linking monomelic units or sites can be 
incorporated into such polymers. For example, cross-linking monomers can be 
incorporated into polyacrylate polymers, which provide sites for cross-linking 
the matrix after dispersing the therapeutic agent into the polymer. Known cross- 

30 linking monomers for polyacrylate polymers include polymethacrylic esters of 
polyols such as butylene diacrylate and dimethacrylate, trimethylol propane 
trimethacrylate and the like. Other monomers which provide such sites include 
allyl acrylate, allyl methacrylate, diallyl maleate and the like. 
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Preferably, a plasticizer and/or humectant is dispersed within the 
adhesive polymer matrix. Water-soluble polyols are generally suitable for this 
purpose. Incorporation of a humectant in the formulation allows the dosage unit 
to absorb moisture on the surface of skin which in turn helps to reduce skin 
5 irritation and to prevent the adhesive polymer layer of the delivery system from 
failing. 

Therapeutic agents released from a transdermal delivery system must be 
capable of penetrating each layer of skin. In order to increase the rate of 
permeation of a therapeutic agent, a transdermal drug delivery system must be 

1 0 able in particular to increase the permeability of the outermost layer of skin, the 
stratum corneum, which provides the most resistance to the penetration of 
molecules. The fabrication of patches for transdermal delivery of therapeutic 
agents is well known to the art. 

For administration to the upper (nasal) or lower respiratory tract by 

15 inhalation, the therapeutic agents of the invention are conveniently delivered 
from an insufflator, nebulizer or a pressurized pack or other convenient means of 
delivering an aerosol spray. Pressurized packs may comprise a suitable 
propellant such as dichlorodifluoromethane, trichlorofluoromethane, 
dichlorotetrafluoroethane, caibon dioxide or other suitable gas. In the case of a 

20 pressurized aerosol, the dosage unit may be determined by providing a valve to 
deliver a metered amount. 

Alternatively, for administration by inhalation or insufflation, the 
composition may take the form of a dry powder, for example, a powder mix of 
the therapeutic agent and a suitable powder base such as lactose or starch. The 

25 powder composition may be presented in unit dosage form in, for example, 
capsules or cartridges, or, e.g., gelatine or blister packs from which the powder 
may be administered with the aid of an inhalator, insufflator or a metered-dose 
inhaler. 

For intra-nasal administration, the therapeutic agent may be administered 
30 via nose drops, a liquid spray, such as via a plastic bottle atomizer or metered- 
dose inhaler. Typical of atomizers are the Mistometer (Wintrop) and the 
Medihaler (Riker). 
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The local delivery of the therapeutic agents of the invention can also be 
by a variety of techniques which administer the agent at or near the site of 
disease. Examples of site-specific or targeted local delivery techniques are not 
intended to be limiting but to be illustrative of the techniques available. 
5 Examples include local delivery catheters, such as an infusion or indwelling 
catheter, e.g., a needle infusion catheter, shunts and stents or other implantable 
devices, site specific carriers, direct injection, or direct applications. 

For topical administration, the therapeutic agents may be formulated as is 
known in the art for direct application to a target area. Conventional forms for 
10 this purpose include wound dressings, coated bandages or other polymer 
coverings, ointments, creams, lotions, pastes, jellies, sprays, and aerosols. 
Ointments and creams may, for example, be formulated with an aqueous or oily 
base with the addition of suitable thickening and/or gelling agents. Lotions may 
be formulated with an aqueous or oily base and will in general also contain one 
1 5 or more emulsifying agents, stabilizing agents, dispersing agents, suspending 
agents, thickening agents, or coloring agents. The active ingredients can also be 
delivered via iontophoresis, e.g., as disclosed in U.S. Patent Nos. 4,140,122; 
4,383,529; or 4,051,842. The percent by weight of a therapeutic agent of the 
invention present in a topical formulation will depend on various factors, but 
20 generally will be from 0.01% to 95% of the total weight of the formulation, and 
typically 0. 1 -25% by weight. 

Drops, such as eye drops or nose drops, may be formulated with an 
aqueous or non-aqueous base also comprising one or more dispersing agents, 
solubilizing agents or suspending agents. Liquid sprays are conveniently 
25 delivered from pressurized packs. Drops can be delivered via a simple eye 
dropper-capped bottle, or via a plastic bottle adapted to deliver liquid contents 
dropwise, via a specially shaped closure. 

The therapeutic agent may further be formulated for topical 
administration in the mouth or throat For example, the active ingredients may 
30 be formulated as a lozenge further comprising a flavored base, usually sucrose 
and acacia or tragacanth; pastilles comprising the composition in an inert base 
such as gelatin and glycerin or sucrose and acacia; and mouthwashes comprising 
the composition of the present invention in a suitable liquid carrier. 
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The formulations and compositions described herein may also contain 
other ingredients such as antimicrobial agents, or preservatives. Furthermore, 
the active ingredients may also be used in combination with other therapeutic 
agents, for example, bronchodilators. 

5 In particular, for delivery of a vector of the invention to a tissue such as 

muscle, any physical or biological method that will introduce the vector into the 
muscle tissue of a host animal can be employed. Vector means both a bare 
recombinant vector and vector DNA packaged into viral coat proteins, as is well 
known for AAV administration. Simply dissolving an AAV vector in phosphate 

1 0 buffered saline has been demonstrated to be sufficient to provide a vehicle useful 
for muscle tissue expression, and there are no known restrictions on the carriers 
or other components that can be coadministered with the vector (although 
compositions that degrade DNA should be avoided in the normal manner with 
vectors). Pharmaceutical compositions can be prepared as injectable 

1 5 formulations or as topical formulations to be delivered to the muscles by 

transdermal transport Numerous formulations for both intramuscular injection 
and transdermal transport have been previously developed and can be used in the 
practice of the invention. The vectors can be used with any pharmaceutical^ 
acceptable carrier for ease of administration and handling. 

20 For purposes of intramuscular injection, solutions in an adjuvant such as 

sesame or peanut oil or in aqueous propylene glycol can be employed, as well as 
sterile aqueous solutions. Such aqueous solutions can be buffered, if desired, 
and the liquid diluent first rendered isotonic with saline or glucose. Solutions of 
the AAV vector as a free acid (DNA contains acidic phosphate groups) or a 

25 pharmacologically acceptable salt can be prepared in water suitably mixed with a 
surfactant such as hydroxypropylcellulose. A dispersion of AAV viral particles 
can also be prepared in glycerol, liquid polyethylene glycols and mixtures 
thereof and in oils. Under ordinary conditions of storage and use, these 
preparations contain a preservative to prevent the growth of microorganisms. In 

30 this connection, the sterile aqueous media employed are all readily obtainable by 
standard techniques well-known to those skilled in the art 

The pharmaceutical forms suitable for injectable use include sterile 
aqueous solutions or dispersions and sterile powders for the extemporaneous 
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preparation of sterile injectable solutions or dispersions. In all cases the form 
must be sterile and must be fluid to the extent that easy syringability exists. It 
must be stable under the conditions of manufacture and storage and must be 
preserved against the contaminating action of microorganisms such as bacteria 
5 and fungi. The carrier can be a solvent or dispersion medium containing, for 
example, water, ethanol, polyol (for example, glycerol, propylene glycol, liquid 
polyethylene glycol and the like), suitable mixtures thereof, and vegetable oils. 
The proper fluidity can be maintained, for example, by the use of a coating such 
as lecithin, by the maintenance of the required particle size in the case of a 
1 0 dispersion and by the use of surfactants. The prevention of the action of 
microorganisms can be brought about by various antibacterial and antifungal 
agents, for example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal and 
the like. In many cases it will be preferable to include isotonic agents, for 
example, sugars or sodium chloride. Prolonged absoiption of the injectable 
1 5 compositions can be brought about by use of agents delaying absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions are prepared by incorporating the AAV vector 
in the required amount in the appropriate solvent with various of the other 
ingredients enumerated above, as required, followed by filtered sterilization. 
20 Generally, dispersions are prepared by incorporating the sterilized active 

ingredient into a sterile vehicle which contains the basic dispersion medium and 
the required other ingredients from those enumerated above. In the case of 
sterile powders for the preparation of sterile injectable solutions, the preferred 
methods of preparation are vacuum drying and the freeze drying technique which 
25 yield a powder of the active ingredient plus any additional desired ingredient 
from the previously sterile-filtered solution thereof 

For purposes of topical administration, dilute sterile, aqueous solutions 
(usually in about 0.1% to 5% concentration), otherwise similar to the above 
parenteral solutions, are prepared in containers suitable for incorporation into a 
30 transdermal patch, and can include known carriers, such as pharmaceutical grade 
dimethylsulfoxide (DMSO). 

The therapeutic compounds of this invention may be administered to a 
mammal alone or in combination with pharmaceutical^ acceptable carriers. As 
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noted above, the relative proportions of active ingredient and carrier are 
determined by the solubility and chemical nature of the compound, chosen route 
of administration and standard pharmaceutical practice. 

The dosage of the present therapeutic agents which will be most suitable 
5 for prophylaxis or treatment will vary with the form of administration, the 
particular compound chosen and the physiological characteristics of the 
particular patient under treatment. Generally, small dosages will be used 
initially and, if necessary, will be increased by small increments until the 
optimum effect under the circumstances is reached. Exemplary dosages are set 
1 0 out in the example below. 

Since AAV has been shown to have a broad host range (for pulmonary 
expression) and persists in muscle, the vectors of the invention may be employed 
to express a gene in any animal, and particularly in mammals, birds, fish, and 
reptiles, especially domesticated mammals and birds such as cattle, sheep, pigs, 
1 5 horses, dogs, cats, chickens, and turkeys. Both human and veterinary uses are 
particularly preferred. 

The gene being expressed can be either a DNA segment encoding a 
protein, with whatever control elements (e.g., promoters, operators) are desired 
by the user, or a non-coding DNA segment, the transcription of which produces 
20 all or part of some RNA-containing molecule (such as a transcription control 
element, +RNA, or anti-sense molecule). 

Muscle tissue is a very attractive target for in vivo gene delivery and gene 
therapy, because it is not a vital organ and is very easy to access. If a disease is 
caused by a defective gene product which is required to be produced and/or 
25 secreted, such as hemophilia, diabetes and Gaucher's disease, and the like, is 
muscle is a good candidate to supply the gene product if the appropriate gene can 
be effectively delivered into the cells. 

Different vectors, such as naked DNA, adenovirus and retrovirus, have 
been utilized to directly deliver various transgenes into muscle tissues. 
30 However, neither system can offer both high efficiency and long-term 

expression. For naked plasmid DNA directly delivered into muscle tissue, the 
efficiency is not high. There are only a few cells near the injection site that can 
maintain transgene expression. Furthermore, the plasmid DNA in the cells 
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remains as non-replicating episomes, i.e., in the unintegrated form. Therefore, it 
will be eventually lost. For adenovirus vector, it can infect the non-dividing 
cells, and therefore, can be directly delivered into the mature tissues such as 
muscle. However, the transgene delivered by adenovirus vectors are not useful 

5 to maintain long-term expression for the following reasons. First, since 
adenovirus vectors still retain most of the viral genes, they are not very safe. 
Moreover, the expression of those genes can cause the immune system to destroy 
the cells containing the vectors (see, for example, Yang et al. 1994, Proc. Natl. 
Acad. Sci. 91 :4407-441 1). Second, since adenovirus is not an integration virus, 

1 0 its DNA will eventually be diluted or degraded in the cells. Third, due to the 
immune response, adenovirus vector could not be repeatedly delivered. In the 
case of lifetime diseases, this will be a major limitation. For retrovirus vectors, 
although they can achieve stable integration into the host chromosomes, their use 
is very restricted because they can only infect dividing cells while a large 

1 5 majority of the muscle cells are non-dividing. 

Adeno-associated virus vectors have certain advantages over the above- 
mentioned vector systems. First, like adenovirus, AAV can efficiently infect 
non-dividing ceils. Second, all the AAV viral genes are eliminated in the vector. 
Since the viral-gene-expression-induced immune reaction is no longer a concern, 

20 AAV vectors are safer than Ad vectors. Thirds, AAV is an integration virus by 
nature, and integration into the host chromosome will stably maintain its 
transgene in the cells. Fourth, AAV is an extremely stable virus, which is 
resistant to many detergents, pH changes and heat (stable at 56°C for more than 
an hour). It can be lyophilized and redissolved without losing its activity. 

25 Therefore, it is a very promising delivery vehicle for gene therapy. 

The invention will be further described by, but is not limited to, the 
following examples. 

Example 1 

Materials and Methods 
30 Constnir.tinn 0 f r A ft V Shuttle Vector. 

A recombinant AAV shuttle vector (AV.GFP3ori) which contained a 
GFP transgene cassette, bacterial ampicillin resistance gene, and bacterial origin 
of replication, was generated from a cw-acting plasmid (pCisAV.GFP3ori). 
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Expression of the GFP gene was directed by the CMV promoter/enhancer and 
SV40 poly-adenylation sequences. pCisAV.GFP3ori was constructed with 
pSub201 derived ITR elements (Samulski et ah, 1987) and the intactness of ITR 
sequences was confirmed by restriction analysis with Smal and PvuII, and by 
5 sequencing. Recombinant AAV stocks were generated by co-transfection of 
pCisAV.GFP3ori and pRep/Cap together with co-infection of recombinant 
Ad.CMVlacZ in 293 cells (Duan et al., 1997). Following transfection of forty 
150 mm plates, cells were collected at 72 hours by centrifugation and 
resuspended in 12 ml of buffer (10 mM Tris pH 8.0). Virus was released from 

10 cells by three cycles of freeze/thawing and passaged through a 25 gauge needle 
six times. Cell lysates were then treated with 1 .3 mg/ml DNase I at 37°C for 30 
minutes and 1% deoxycholate (g/ml final) and 0.05% trypsin (g/ml final) at 
37 °C for 30 minutes. Samples were then placed on ice for 10 minutes and 
centrifuged to remove large particulate material at 3,000 rpm for 30 minutes. 

1 5 rAAV was purified by isopycnic density gradient centrifugation in CsCl 

(r=1.4) in a SW55 rotor for 72 hours at 35K. Peak fractions of AAV were 
combined and re-purified through two more rounds of CsCl centrifugation, 
followed by heating at 58 °C for 60 minutes to inactivate all contaminant helper 
adenovirus. Typically, this preparation gave approximate AAV titers of 10 12 

20 DNA molecules/ml and 2.5 x 10 8 GFP-expressing units/ml. Recombinant viral 
titers were assessed by slot blot and quantified against pCisAV.GFP3ori controls 
for DNA particles. Functional transducing units were quantified by GFP 
transgene expression in 293 cells. The absence of helper adenovirus was 
confirmed by histochemical staining of rAAV infected 293 cells for 

25 beta-galactosidase, and no recombinant adenovirus was found in 10 10 particles of 
purified rAAV stocks. The absence of significant wtAAV contamination was 
confirmed by immunocytochemical staining of rAAV/Ad co-infected 293 cells 
with anti-Rep antibodies. These studies, which had a sensitivity of 1 wtAAV in 
10 10 rAAV particles, demonstrated an absence of Rep staining as compared to 

30 pRep/Cap plasmid transfected controls. 
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Isolation and Structural Evalu ation of AAV Circular TnteTmediates From Hela 
Cells 

Hela cells were grown in 35 mm dishes in DMEM media supplemented 
with 10% fetal calf serum (FCS). Cells were infected in the presence of 2% FCS 
5 at 80% confluency with recombinant AV.GFP3ori (MOI=1000 particles/cell, 1 x 
10 9 total particles/plate) and Hirt DNAs isolated as described by Duan et al. 
(1997) at 6, 12, 24, 48, and 72 hours post-infection. In experiments analyzing 
the effects of adenovirus, plates were co-infected with Ad.CMVLacZ 
(MOI=5000 particles/cell) in the presence of 2%FCS/DMEM. Zero hour 

1 0 controls were generated by mixing 1 0 9 particles of AV.GFP3ori with cell lysates 
prior to Hirt DNA preparation. Hirt DNA isolated at each time point was used to 
transform E. coli SURE cells (Stratagene, La Jolla, CA.). Typically, 1/10 of the 
Hirt DNA preparation was used to transform 40 ml of competent bacteria by 
electroporation. The resultant total number of bacterial colonies was quantified 

1 5 for each time point and the structure of circular intermediates was evaluated for 
greater than 20 plasmid clones for each time point from two independent 
experiments. Structural determinations were based on restriction enzyme 
analysis using PstI, SphI, Asel single and double digests together with Southern 
blotting against GFP, staffer, and ITR probes. 

20 Evaluation of F r 2a and GFP gene ex pression in Hela cells. 

E2a gene expression was evaluated by immuno fluorescent staining of 
Hela cells superinfected with El-deleted Ad.CMVlacZ (MOI= 0, 500, 5000 
particles/cell). Briefly, cells were fixed in methanol at -20°C for 10 minutes 
followed by air drying. Cells were then incubated at room temperature with 

25 hybridoma supernatant against Ad5 72kd DBP (Reich et al., 1983), followed by 
goat anti-mouse-FITC antibody (5 mg/ml) for 30 minutes at room temperature. 
In studies evaluating augmentation of AAV GFP transgene expression by 
adenovirus, Hela cells were harvested at 24 or 72 hours post-infection by 
trypsinization, resuspended in 2%FCS/PBS and evaluated by FACS analyses. 

30 Thresholds were set using uninfected controls and the percentage and/or the 

average relative fluorescent intensity was determined by sorting greater than 1 0 5 
cells per experiment condition. 
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Sequence Analysi s of AAV Circular Intermediates. 

Sequence analysis of the ITR array within circular intermediates was 
performed using primers ELI 18 (5 ' -CGGGGGTCGTTGGGCGGTC A-3 ' ; SEQ 
ID NO:l) and EL230 (5 '-GGGCGGAGCCTATGGAAAA-3 '; SEQ ID NO:2) 
5 which are nested to 5' and 3' ITR sequences, respectively. Both circular and 
linearized (with Smal which cuts within ITR sequences) plasmids were 
sequenced. 
Results 

Construction of rAAV Shuttle V ector and Isolation of Circular Intermediates. 

1 0 To circumvent the inability to retrieve pre-integration intermediates or as 

stable episomal forms resistant to nuclease digestion, an alternative strategy was 
developed to "trap" circular intermediates using a recombinant AAV shuttle 
vector. Recombinant AV.GFP3ori virus (Figure IB) was generated from a 
cw-acting plasmid (pCisAV.GFP3ori, Figure 1 A) by co-transfection in 293 cells 

1 5 with trans-acting plasmids encoding Rep and Cap viral genes. This viral vector 
(AV.GFP3ori) encoded the green fluorescent protein (GFP) reporter gene, a 
bacterial origin of replication (ori), and the bacterial ampicillin-resistance gene. 
Ori and ampicillin-resistance sequences encoded in this virus allow for the 
rescue of circular AAV genomes formed during the transduction process. 

20 To test this strategy, Hela cells were infected with AV.GFP3ori 

(MOI=1000 particles/cell) and the abundance of circular intermediates was 
evaluated following transformation of low molecular weight cellular Hirt DNA 
into £. coli SURE cells. The presence of circular intermediates was inferred by 
retrievable arnpicillin-resistant bacterial colonies. Structural features of circular 

25 intermediates were determined by restriction enzyme analysis and Southern 
blotting with various regions of the provirus, including GFP, Stuffer, and ITR 
sequences. 

The predominant circular form isolated after transduction of Hela cells 
with AV.GFP3ori consisted of 4.7 kb monomer-sized molecules (Figure 1C). 
30 SphI digestions of these circular intermediates yielded characteristic 300 bp 
bands which hybridized to an ITR probe on Southern blots (Figure 2A). PstI, 
SphI, Asel single and double digests together with Southern blot analysis using 
GFP, Stuffer (data not shown), and ITR (Figure 2A) probes confirmed the 
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structure of the circular intermediates as head-to-tail monomer genomes (Figure 
1C). In particular, PstI digests together with ITR Southern blots distinguish 
these head-to-tail circular intermediates from head-to-head or tail-to-tail circular 
dimers. Similar results obtained with studies on AV.GFP3ori infected 293 cells 
5 and primary fibroblasts have confirmed that monomer head-to-tail circular 
intermediates were also the most abundant form in these cell types. 

Because the predicted molecular weight of an intact head-to-tail ITR 
SphI fragment would be approximately 360 bp, an anomalous migration in 
agarose gels might be due to the high secondary structure of inverted repeats 

10 within ITRs. To this end, the head-to-tail orientation of the ITRs, as predicted 
by Southern blot analysis, was confirmed using several sequencing strategies. 
First, the SphI ITR hybridizing fragment of a circular intermediates was 
subcloned into a secondary plasmid vector and sequenced with primers outside 
the ITR cloned sequences. These findings confirmed the head-to-tail orientation 

1 5 of ITRs. Additionally, sequence was obtained directly from six monomer 
circular intermediate clones using primers internal to both the 5' and 3' ITRs 
(Figure 2C). In these studies, circular intermediates were digested with Smal 
and the linear 4.6 kb plasmid was gel isolated prior to sequencing. Smal 
digestion (which relaxed the secondary structure of ITRs) was necessary to 

20 obtain sequence information within the ITRs. The sequencing results presented 
in Figure 2C confirmed the orientation of head-to-tail ITR arrays in these 
intermediates. 

Interestingly, sequencing also revealed several consistent base pair (bp) 
changes in four of the six clones analyzed (Figure 2C). These four clones (p79, 

25 p81, p87, and p88) had consistent two bp changes within the D-sequence [G->A 
(122bp) and A->G (125bp)], which always occurred together with the bp 
alterations in the p5 promoter [A->G (1 14bp) and A->C (1 15bp)]. No other 
consistent bp changes were noted except for two clones (p79 and p88) which 
demonstrated mutations just outside the 3'ITR D-sequence [T->G (381bp) and 

30 T->C (383bp)]. 

Although head-to-tail circular intermediates were the most abundant 
forms present in Hirt DNA from rAAV infected Hela cells, several less frequent 
structures were also detected. These included monomer circularized AAV 
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genomes with one (pi 90) and three ITRs (p345) arranged in a head-to-tail 
fashion as well as several clones with an unknown structure lacking complete 
ITRs (p340) (Figure 2A). Such diversity within the ITR amy may represent 
homologous recombination in vivo or in bacteria during amplification. However, 
5 previous studies demonstrating similar variations in ITR sequences of 

head-to-tail integrated genomes, suggest that such changes in the length of the 
ITR array may occur in vivo (Duan et aL, 1997) Additionally, less frequent 
head-to-tail circularized multimer forms were predicted based on the variation in 
migration patterns of uncut plasmids which gave identical restriction patterns. 
1 0 Results shown in Figure 2B confirmed the existence of monomer and dimer 
head-to-tail circular intermediates using partial digestion with an enzyme which 
cuts once in the AAV genome ( Asel). Cumulative analysis of greater than 200 
independently isolated circular intermediates from Hela cells demonstrated that 
head-to-tail circular AAV genomes occurred in greatest abundance as monomers 
1 5 (92%) and less frequently as multimers of greater than one genome (8%). 

To establish that head-to-tail circular intermediates were formed in vivo 
and not by non-specific bacterial recombination of linear AAV genomes present 
in the Hirt DNA, a set of reconstitution experiments was performed by which the 
same number of rAAV particles used for infection experiments were spiked into 
20 Hela cell lysates prior to Hirt preparations. In these studies, background 
bacterial amplification of Hirt DNA spiked with linear rAAV genomes was 
negligible (Figure 3D) and of the few isolated colonies obtained from these 
controls, none had a predicted head-to-tail structure as assessed by Southern blot 
restriction enzyme analysis (Figure 3E). Additionally, reconstitution 
25 experiments transforming bacterial with linearized dsDNA AAV genomes did 
not give rise to significant levels of replication competent plasmids or the 
characteristic head-to-tail structure associated with AAV circular intermediates. 
These findings confirm that circular intermediates do not likely arise from 
non-specific recombination or ligation events with either ssDNA or dsDNA 
30 linear AAV genomes in bacteria. Additional control experiments, demonstrating 
the lack of stuffer hybridizing sequences in AAV circular intermediates by 
Southern blotting, also confirm that these structures do not arise from 
contamination of viral stocks with pCisAV.GFP3ori plasmid. 
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Thft formation of hea ^-to-tail r.irr.nlar AAV intermediates is ailPmented by 
suppHnfertion with R1 -deleted adenovirus. 

Many aspects of the wtAAV growth cycle are affected by helper 
adenovirus, including AAV DNA replication, transcription, splicing, translation, 
5 and virion assembly. Such studies have provided concrete evidence that a subset 
of Ad early gene products provide helper functions for the wtAAV lytic cycle, 
including: Ela, Elb, E2a, E4 ORF6 and VA1 RNA (Muzyczka, 1992). In this 
regard, one of the most critical factors which is required for AAV replication is 
the 34 kD E4 protein (ORF6). Recent observations on the helper function of Ad 
10 in rAAV transduction have also demonstrated that Ad E4 ORF6 is essential for 
the augmentation of rAAV transgene expression seen with adenovirus 
co-infection (Ferrari et al, 1996; Fisher et aL, 1996). According to these reports, 
the rate-limiting step enhanced by these adenoviral proteins is the conversion of 
single stranded AAV genomes to double stranded forms. 
1 5 Studies evaluating the kinetics of rAAV circular intermediate formation 

demonstrated a time-dependent increase in abundance which peaked at 24 hours 
post-infection in Hela cells and coincided with the onset of GFP transgene 
expression (Figure 3). To better understand the cellular mechanisms associated 
with AAV circular intermediate formation, the effects of adenoviral co-infection 
20 on this process were evaluated. The extent of transgene expression and circular 
intermediate formation in AV.GFP3ori infected Hela cells with or without 
co-infection with El -deleted recombinant adenovirus was compared. 

Although El -deleted adenoviruses are severely handicapped in their 
ability to synthesize viral gene products, at high MOIs of >5000 significant E2a 
25 protein expression was noted (Figure 3A). As an indicator of transgene 

expression, the abundance and average relative intensity of GFP positive cells 
was determined against mock infected controls by fluorescent microscopy 
(Figure 3B) and FACS analysis (Figure 3C) at 72 hours post-infection. In accord 
with previous reports demonstrating augmentation in rAAV transgene expression 
30 by adenovirus (Ferrari et al, 1996; Fisher et al., 1996), the extent of GFP 

transgene expression was dramatically increased at doses of adenovirus which 
led to viral gene expression (MOI>5000; Figures 3A-C). Additionally, 
persistence of rAAV transgene expression was also augmented by co-infection 
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with El-deleted adenovirus, as determined by GFP-expressing colony formation 
following serial passages (Figure 3C). 

If circular intermediates represent a molecular form of rAAV important 
for efficient and/or persistent transgene expression, augmentation of rAAV 
5 transgene expression by adenovirus might also modulate circular intermediate 
formation. In these studies, the abundance and time course of AAV circular 
intermediate formation was evaluated following superinfection with 
Ad.CMVLacZ. Results from these experiments are shown in Figure 3D, which 
represents the total number of bacterial colonies (per 35 mm plate) obtained 

10 following transformation of £. coli with Hirt DNA isolated from Hela cells 
infected with AV.GFP3ori (1000 DNA particles/cell) with or without 
co-infection with Ad.CMVlacZ (5,000 particles/cell). An MOI of 5000 Ad 
particles/cell was chosen for these experiments since this level of adenovirus led 
to minimal cytopathic effect (CPE) with high levels of E2a expression. 

1 5 These studies demonstrated a nearly 2-fold augmentation by 

Ad.CMVLacZ in the total abundance of AAV rescued plasmid intermediates in 
E. coli (Figure 3D). Southern blot restriction enzyme analysis demonstrated that 
the predominant forms in both the presence and absence of adenovirus were 
head-to-tail monomer circular intermediates containing the diagnostic 300 bp 

20 ITR fragment following SphI digestion (Figure 3E). Additionally, results 
demonstrated that adenovirus co-infection led to an earlier time of onset and 
increased stability of AAV head-to-tail monomer circular intermediates (Figures 
3E and F). For example, at 6 hours post-infection, head-to-tail circular 
intermediates were only present in Hela cells co-infected with adenovirus. 

25 Furthermore, a decline in the percentage of head-to-tail circular intermediate 

clones was seen at 48-72 hours post-AAV infection in the absence of adenovirus. 
In contrast, this decline was significantly blunted by the presence of helper 
adenovirus (Figure 3F). Based on these findings, it was concluded that certain 
adenoviral proteins produced by superinfection with El -deleted adenovirus were 

30 capable of modulating circular intermediates formation and stability during 
rAAV transduction. 
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Discussion 

In the present study, it was shown that circularization of linear AAV 
genomes occurs during rAAV transduction. Circularization appears to 
predominately occur as head-to-tail monomer genomes. However, the existence 
5 of less abundant circular multimer forms suggests that recombinational events 
subsequent to the initial infection may drive concatamerization of circular 
genomes. The diversity in the length of ITR arrays found within circular 
intermediates (i.e., 1-3 ITRs) also supports the notion that these forms may be 
highly recombinagenic. Of mechanistic interest in the formation of circular 

10 intermediates is the uniformity of mutations observed in the D-sequences and 
nearby p5 promoter region and the confinement of these mutations to the 
5 '-ITRs. Although the etiology of these base pair changes is unknown, their 
uniformity suggests that they may have a direct role in the formation of circular 
intermediates and in increased stability. Recent findings, which suggest that an 

1 5 endogenous host single strand D-sequence binding protein is important in rAAV 
transduction, lend support to the potential involvement of this sequence in 
circular intermediate formation (Wang et al., 1997; Qing et aL, 1998). 
Furthermore, it remains to be determined whether the in vivo formation of AAV 
circular intermediates occurs through the circularization of single or double 

20 stranded AAV genomes. 

By analogy, retroviral transduction intermediates have striking 
similarities to the current findings with AAV. Three DNA forms have been 
isolated following retroviral infection, including linear DNA with long terminal 
repeats (LTRs) at both ends, circular DNA with one LTR, and circular DNA 

25 with multiple LTRs (Panganiban, 1985). Although it is disputed which of these 
forms are the direct precursor to integration, the existence of circular retroviral 
genomes which also have similar repeat regions at the ends of their genomes 
suggests the potential for common mechanisms with the formation of AAV 
circular intermediates. These AAV circular intermediates could act as 

30 integration precursors and/or stable episomal genomes. 

The head-to-tail ITR structures found in AAV circular intermediates are 
most characteristic of latent integrated AAV genomes. In contrast, lytic phases 
of AAV growth are typically associated with head-to-head and tail-to-tail 
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replication form genomes. Hence, it is likely that circular intermediates represent 
a latent aspect of the AAV life cycle. The finding that co-infection with 
adenovirus leads to increased abundance and stability of AAV circular 
intermediates suggests a novel link between adenoviral helper functions and 

5 latent infection of AAV . 

Aspects of inverted head-to-tail ITRs, which include palindromic 
hairpins similar in structure to "Holliday-like" junctions, might impart 
recombinagenic activity which aids in viral integration. Such Holliday junctions 
have been shown to play critical roles in directing homologous recombination in 

1 0 bacteria through the processing of recombination intermediates by RuvABC 
proteins (West, 1997; Lee et al., 1998). Interestingly, a mammalian 
endonuclease, analogous to bacterial RuvC resolvase, has also been isolated 
from cell lines (Hyde et al, 1994). Despite the theoretical considerations which 
might suggest that circular AAV genomes have characteristics of preintegration 

1 5 intermediates, a study with recombinant retrovirus has demonstrated that 
palindromic LTR-LTR junctions of MMLV are not efficient substrates for 
proviral integration (Lobel et al., 1989). Nonetheless, circular AAV genomes 
have been previously proposed as integration intermediates based on proviral 
structure (Linden et al., 1996). 

20 Example 2 

Methods 

Production of rA AV Shuttle Vector. 

The cw-acting plasmid (pCisAV.GFP3ori) used for rAAV production 
was generated by subcloning the Bspl201/Not I fragment (743 bp) of the GFP 

25 transgene from pEGFP-1 (Clontech) between the CMV enhancer/promoter and 
SV40polyA by blunt-end ligation. A 2.5 kb cassette containing beta-lactamase 
and bacterial replication origin from pUC19 was blunt ligated down-stream of 
GFP reporter cassette. The ITR elements were derived from pSub20 1.2 The 
entire plasmid contains a 4.7 kb AAV component flanked by a 2 kb stuffer 

30 sequence. The integrity of ITR sequences was confirmed by restriction analysis 
with Smal and PvuII, and by direct sequencing using a modified di-deoxy 
procedure which allowed for complete sequence through both 5' and 3' ITRs. 
Recombinant AAV stocks were generated by co-transfection of 
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pCi$AV.GFP3ori and pRep/Cap together with co-infection of recombinant 
Ad.CMVlacZ in 293 cells. The rAV.GFP3ori virus was subsequently purified 
through 3 rounds of CsCl banding as described in Duan et ah, 1997. The typical 
yields from this viral preparation were 1012 DNA molecules/ml. 
5 DNA titers were determined by viral DNA slot blot hybridization against 

GFP 32 P-labeled probe with copy number plasmid standards. The absence of 
helper adenovirus was confirmed by histochemical staining of rAAV infected 
293 cells for beta-galactosidase, and no recombinant adenovirus was found in 
10 10 particles of purified rAAV stocks. The absence of significant wtAAV 
10 contamination was confirmed by immunocytochemical staining of rAAV/Ad 
co-infected 293 cells with anti-Rep antibodies. Transfection with pRep/Cap was 
used to confirm the specificity of immunocytochemical staining. No 
immunoreactive Rep staining was observed in 293 cells infected with 10 10 rAAV 
particles. 

15 Isolation of AAV Circu lar Intermediates From Muscle. 

The tibialis anterior muscle of 4-5 week old C57BL/6 mice were infected 
with AV.GFP3ori (3 X 10 10 particles) in Hepes buffered saline (30 fxl). GFP 
expression was analyzed by direct immunofluorescence of freshly excised tissues 
and/or in formalin-fixed cryopreserved tissue sections in four independently 

20 injected muscles harvested at 0, 5, 10, 16, 22 and 80 days post-infection. Tissue 
sections were counter-stained with propidium iodide to identify nuclear DNA. 
Hirt DNA (Hirt, 1967) (20 ml per muscle sample) was isolated from at least 
three independent muscle specimen for each time point and used to transform £. 
coli SURE cells using 3 ml of Hirt with 40 ml of electrocompetent bacterial 

25 (approximately 1 x 10 9 cfu/ug DNA, Strategene Inc.). The resultant total number 
of bacterial colonies was quantified for each time point and the abundance of 
head-to-tail circular intermediates was evaluated for each time point (> 20 
bacterial clones analyzed) by PstI, Asel, SphI, and Pstl/Asel digestion, and 
confirmed by Southern blot analysis using ITR, GFP and stuffer probes. The 

30 head-to-tail configuration in typical clones were also confirmed by dideoxy 

sequencing using primers ELI 18 (S'-CGGGGGTCGTTGGGCGGTCAO'; SEQ 
ID NO:l) and EL230 (5 '-GGGCGGAGCCTATGG AAAA-3 ' ; SEQ ID NO:2) 
which are nested to 5' and 3' ITR sequences, respectively. Zero hour controls 
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were generated by mixing 3 x 10 10 particles of AV.GFP3ori with control 
uninfected muscle lysates prior to Hirt DNA preparation. As described in Table 
1 , a number of additional controls for were performed to rule out non-specific 
recombination of linear AAV genomes in bacteria as a source for isolated 
5 circular intermediates. 



Table 1. Control Experiments for Rescue of Circular Intermediates in Bacteria 



10 


Type of Input 
DNA 


Source of DNA 


Number of 
Molecules 


Number of 
Amp Resistant 
Bacterial 
Colonies 


Presence of 
Head-to-Tail 

Circular 
Intermediates* 




Purified rAW 


Hirt from Infected 
Muscle (22 day) 


3 x 10 !0 


approximately 
5x10 s 


Yes 




Purified rAAV 


Virus 
reconstituted into 
Uninfected 
Muscle Hirf 


3 x 10 10 


0 


No 


15 


Linear ssDNA 
Encompassing 
rAAV 
Genome 6 


Isolated from 
Purified Virus 


3 x 10 10 


2 


No 


20 


Linear dsDNA 
Encompassing 
Entire rAAV 
Genome 


Isolated from 
proviral plasmid 
(HindIII/PvuII) c 


3 x 10 10 


3 


No 


25 


Linear dsDNA 
Encompassing 
Entire rAAV 
Genome 
+ ligase d 


Isolated from 
proviral plasmid 
(Hindlll/PvuII) 


3xl0 10 


>6xl0 3 


Yes 



a Purified virus was reconstituted into muscle homogenates prior to preparation 
of Hirt DNA 



30 b Viral DNA predominantly contained single stranded genomes as evident by 
Southern blot analysis against with ITR probe. However, small amount of 
dsDNA AAV genomes also existed and are likely due to reannealing of single 
stranded genomes during preparation. Purified viral DNA concentrations were 
determined by OD 260 and 75 ng representing approximately 3 x 10 10 viral 

35 genomes were used for transformation of bacteria. 

c Hindlll/PvuII digestion was used to remove the entire rAAV genome from 
pcisAV.GFP3ori. Hindlll and PvuII leave 10 and 0 bps of flanking sequence 
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outside the 5 ; and 3' ITRs, respectively. The linear dsDNA fragment (4.7 kb) 
was gel isolated following blunting with T4 DNA polymerase and the DNA 
concentration determined by OD 260 . One hundred and fifty ng of linear fragment 
representing approximately 3 x 10 10 viral genomes were used for transformation 
5 of bacteria. 

d Linear dsDNA viral genomes (HindHI/PvuII blunted fragment) were treated 
with T4 DNA ligase prior to transformation of bacteria. 

10 e The presence of head-to-tail circular AAV intermediates were confirmed by 
restriction enzyme digestion (Asel, PstI, and SphI) and Southern blotting against 
ITR probe. 

Fractionation of muscle Hirt DNA preparations, 

1 5 Preparative-scale fractionation of the muscle Hirt DNA was performed 

by 1% agarose gel electrophoresis using the Bio-Rad Mini Prep Cell (Catalog 
#170-2908). A 4.5 ml (10.5 cm) tubular gel containing 1 x TBE buffer was 
poured according to manufacturer's specification. A total of 20 ml Hirt 
preparation from one entire muscle sample was loaded on top of the gel. 

20 Electrophoresis was carried out at a constant current of 10 mA over a period of 
5 hours. Sample eluent was drawn from the preparative gel apparatus by a 
peristaltic pump at a rate of 100 ml/min and eluted into a fraction collector at 
250 ml/fraction. The collected DNA was subsequently concentrated by standard 
ethanol precipitation and used to transform SURE bacterial cells by 

25 electroporation as described above. 

In vitrn Persistency of AAV Circular Intermediates. 

Transgene expression and persistence of AAV circular intermediate 
plasmid clones were evaluated following transient transfection in Hela and 293 
cells. Subconfluent monolayers of Hela cells in 24-well dishes were transfected 

30 with 0.5 mg of either AAV circular intermediates (p81 or p87) or pCMVGFP 
using Lipofectamine (Gibco BRL Inc.). The cultures were then incubated for 5 
hours in serum free DMEM followed by incubation in DMEM supplemented 
with 10% fetal bovine serum. All plasmid DNA samples used for transfections 
were spiked with pRS VlacZ (0.5 mg) as an internal control for transfection 

35 efficiency. At 48 hours post-transfection, cells were passaged at a 1 : 1 0 dilution 
and allowed to grow to confluency (day 5), at which time GFP clones were 
quantified for size and abundance using direct fluorescent microscopy. The 
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percent of beta-galactosidase-expressing cells was also quantified at this time 
point by X-gal staining. At 5 days, cells were passaged an additional time (1:15 
dilution) GFP clones were quantified again at day 10. The persistence of 
plasmid DNA at passage-5, 7, and 10 days post-transfection was evaluated by 
5 Southern blot analysis of total cellular DNA using 32 P-labeied GFP probes. To 
determine whether the head-to-tail ITR array within circular intermediates was 
responsible for increases in the persistence of GFP expression, the head-to-tail 
ITR DNA element was subcloned into the pGL3 luciferase plasmid to generate 
pGL3(ITR). The head-to-tail ITR DNA element was isolated from a monomer 

1 0 circular intermediate (p8 1 ) by Aatll and Haell double digestion and 

subsequently inserted into the Sail site of pGL3 (Proraega) by blunt ligation. 
The resultant plasmid pGL3(ITR) contains the luciferase reporter and 
head-to-tail ITR element 3' to the polyA site. The integrity of the ITR DNA 
element within this plasmid was confirmed by sequencing. The persistence of 

1 5 transgene expression from pGL3(ITR) was compared to that of pGL3 by 
luciferase assays on transiently transfected Hela cells as described above and 
analyzed at 10 days (passage-2). Transfection efficiencies were normalized 
using a dual renilla luciferase reporter vector (pRLS V40, Promega). 
Results 

20 AAV Circular Intermediate* Represent Stable Fpisomal Forms of Viral DNA 
AssnHateri with f ring-term Persistence of Transgene Expression in Muscle. 

To evaluate the molecular characteristics of rAAV genomes in muscle, a 
rAAV shuttle viral vector (AV.GFP3ori) was utilized which harbors an 
ampicillin resistance gene, bacterial origin of replication, and GFP reporter gene 

25 (Figure 1 A). This recombinant virus was used to evaluate the presence of 

circular intermediates by bacterial rescue of replication competent plasmids. In 
these studies, delivery of AV.GFP3ori (3 x 1010 particles) to the tibialis muscle 
of mice led to GFP transgene expression which peaked at 22 days and remained 
stable for at least 80 days (Figure 4A). These results confirmed previous 

30 successes in rAAV mediated gene transfer to muscle (Kessler et al., 1996; 
Herzog et al., 1997; Xiao et al., 1996; Clark et al., 1997; Fisher et al., 1997). 
The formation of circular intermediates was evaluated by £. coli transformation 
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of Hirt DNA harvested from muscle at 0, 5, 10, 16, 22, and 80 days 
post-infection with AV.GFP3ori. 

In these muscle samples, circular intermediates were found to have a 
characteristic head-to-tail structure with 1-2 ITR repeats. The most abundant 

5 form included two inverted ITRs within a circularized genome (Figure 4B, clone 
pi 7). This figure also depicts a less frequent form (< 5%) of circular 
intermediates observed, p439, with undetermined structure. When this type of 
replication competent plasmid was seen, it was not included in the 
quantification of head-to-tail circular intermediates since its structure could not 

10 be conclusively determined. The total abundance of muscle Hirt derived 
head-to-tail circular intermediates (with 1-2 ITRs) demonstrated a 
time-dependent increase that peaked with transgene expression at 22 days and 
slightly decreased by day 80 (Figure 5A). Increased diversity in the length of 
ITR arrays within circular intermediates was seen at longer time points. For 

15 example, Figure 5B demonstrates several isolated circular intermediates with 1-3 
ITRs isolated from 80 days muscle Hirt samples. This is in contrast to the more 
uniform structure of circular intermediates with two ITRs in a head-to-tail 
conformation at 5-22 days post-infection. 

To evaluate the potential for artifactual rescue of linear rAAV genomes 

20 by recombination in bacteria, several control experiments were performed. First, 
uninfected control muscle Hirt preparations, spiked with an equal amount of 
rAAV virus used for in vivo infection of muscles, failed to give rise to 
replicating plasmids following transformation of E.colt Second, when a blunted 
linear double stranded HindlH/PvuII fragment isolated from pcisAV.GFP3ori 

25 (encompassing the entire rAAV genome) was used to transform bacteria, no 
ampicillin resistant bacterial colonies were obtained. The addition of T4 ligase 
to this fragment, however, led to significant numbers of bacterial colonies. 
Third, when purified single stranded rAAV DNA was used for transformation, 
no bacterial colonies were obtained. As summarized in Table 1, these results 

30 confirm that in the absence of productive infection, rAAV genomes themselves 
are incapable of recombining into replication competent plasmids in bacteria. 
Hence, in vivo circularization of rAAV genomes is a prerequisite for rescuing 
autonomously replicating plasmids in £. coli with this shuttle vector. 
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Molecular weight of rirrmlar in t ermediates snppest a conversion from monomer 

to multimer forms over time. 

To further characterize the circular intermediates isolated from muscles, 
Hirt samples from 22 days and 80 days post-infected muscles were size 

5 fractionated by continuous-flow gel electrophoresis (BioRad). As shown in 
Figure 6 t the majority of circular intermediates at 22 days post-infection size 
fractionated at a molecular weight of less than 3 Kbp. Very few clones were 
isolated from fractions between 3 to 5 kb and no clones were obtained from 
fractions larger than 5 kb at this time point. Furthermore, this size fractionated 

1 0 molecular weight of in vivo Hirt derived circular intermediates at 22 day time 
points correlated with that of head-to-tail monomer undigested circular 
intermediate plasmids rescued in bacteria from this same time point 
(approximately 2.5 kb). These data suggest that at early time points 
post-infection in muscle, the predominant form of circular intermediates likely 

1 5 occurs as monomer genomes. The lower mobility of this fraction as compared to 
replication form monomer (Rfin=4.7 kb) and dimer (Rfd=9.4 kb) genomes 
provides indirect evidence that these forms are not responsible for rescued 
plasmids in these Hirt samples. Interestingly, when 80 day muscle Hirt samples 
were size fractionated, more clones were retrieved from higher molecular weight 

20 fractions ranging from 3-12 kb (Figure 6). This shift in the molecular weight of 
circular intermediates indicates the potential for recombination between 
monomer forms in the generation of large circular multimer genomes. Such 
concatamerization has been previously observed in muscle and has traditionally . 
been hypothesized to involve linear integrated forms of the AAV genome 

25 (Herzog et al., 1997; Xiao et al., 1996; Clark et al., 1997; Fisher et aL, 1997). 
This data sheds new light on the molecular characteristics of these persistent 
AAV genomes and suggests that they are in fact circular and episomal. Based on 
yields of retrievable circular plasmids reconstituted in Hirt DNA, the efficiency 
of bacterial transformation, and the initial innoculum of virus, we estimate that 

30 approximately 1 in 400 viral DNA particles circularize following infection in 
muscle (Table 2). 
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Bacterial 
Transformation 


Starting Number 
of Plasmid or 
AAV Genomes 


Actual Number 
of Amp r cfu 


Adjusted Yield 


Hirt DNA from rAAV 
Infected Muscle a 


3 x 10 10 molecules 


5x10* cfu 


5xl0 5 cfu c 


Hirt DNA + 230 ng 
LacZPlasmid b ' c 


3 x 10 10 molecules 


2xl0 6 cfii d 


2 x 10 8 cfu 


230 ng LacZ Plasmid c 


3 x 10 10 molecules 


2xl0 8 cfu 





a The actual amount of Hirt used for transformation was 3/20 the entire Hirt 
DNA. The numbers have been adjusted to reflect viral innoculum and yields for 
the entire muscle. 

15 

b Plasmid DNA was spiked into mock infected muscle homogenates prior to 
isolation of Hirt DNA. This reconstituted Hirt DNA was then used for 
transformation of bacteria. 

20 c The actual microgram amounts of plasmid used in reconstitution experiments 
was 10 ng. The numbers have been adjusted for comparison to normalize the 
number of plasmids genomes to that used in AAV experiments. Control LacZ 
plasmid was approximately 7000 bp with a molecular weight of 4.6 x 10 6 
g/mole. 

25 

d The average of several experiments indicates an approximate 100-fold 
reduction in the number of cfu recovered from bacterial transformations with 
DNA isolated from Hirt extract spiked with plasmids as compared to 
transformation with an equivalent amount of plasmid DNA alone. 

30 

c Adjusted yield indicate approximately 1 in 400 AAV genomes circularize in 

vivo. 

Given the fact that not all rAAV particles likely contain functional DNA 
35 molecules and intermediates may integrate, these calculations may represent an 
underestimation. 

AAV Circular Intermediates D emonstrate Increased Persistence, as Plasmid 

Based Vectors. 

40 Based on the finding that circular AAV intermediates were associated 

with long term persistence of transgene expression in muscle, rAAV circular 
head-to-tail intermediates may be molecular structures of the AAV genome 
associated with the latent life cycle and increased episomal stability. Several 
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aspects of the structure of AAV circular intermediates may account for their 
increased stability in vivo. First, circularization of AAV genomes may create a 
nuclease resistant conformation. Secondly, since the only viral sequences 
contained within circular intermediates are the head-to-tail ITR array, these 

5 sequences might bind cellular factors capable of stabilizing these structures in 
vivo. Several studies have demonstrated increased persistence of transgene 
expression with plasmid DNA encoding viral ITRs (Philip et aL, 1994; Vieweg 
et al., 1995). The results described above provide a functional explanation for 
the increased persistence through the association with circular intermediate 

1 0 formation as part of the AAV life cycle. 

To more closely evaluate the persistence of AAV head-to-tail circular 
intermediates, several in vitro experiments were performed by transfecting these 
intermediates into Hela cells and assessing the stability of plasmid DNA and 
transgene expression by GFP clonal expansion. Results from Hela cell 

1 5 transfection experiments demonstrated that two monomer head-to-tail circular 
intermediates (p8 1 and p87) studied gave rise to a 10-fold higher number of five 
and ten day transgene-expressing clones, as compared to a control pCMVGFP 
plasmid lacking the ITR sequences (Figures 7A and B). Additionally, the size of 
GFP positive colonies at 5 days post-transfection was three-fold larger in Hela 

20 cells transfected with p81 and p87, as compared to the pCMVGFP control vector 
(Figures 7A and B). These studies suggest the AAV circular intermediates have 
increased stability of transgene expression and substantiate findings in muscle. 

To confirm the increased molecular persistence of head-to-tail circular 
intermediates following transfection into Hela cells, total DNA (low and high 

25 molecular weight) was isolated from cultures of pCMVGFP and p81 transfected 
Hela cells at various passages post-transfection and analyzed by Southern 
blotting. Southern blots hybridized to 32 P-labeled GFP probes demonstrated a 
significantly higher level of p81 plasmid DNA at passage-7 as compared to the 
control vector lacking the head-to-tail ITR sequence (Figure 7C). The majority 

30 of signal in undigested DNA samples was associated with a 4.7 kb band 
migrating at the approximate size of the uncut monomer plasmids. Together 
with the fact that the majority of signal from all cell cultures in Figure 7C 
disappeared by passage- 10, these data suggest that these plasmids predominantly 
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remained episomal. Thus, in both muscle and Hela cells, increased persistence 
of AAV circular intermediates is correlated with stable transgene expression. 
ITR arrays are responsible for increased persistence, 

To investigate whether the head-to-tail ITR DNA element was 
5 responsible for the increased persistence of circular intermediates, we cloned this 
DNA element into a secondary luciferase vector (pGL3) to give rise to 
pGL3(ITR). Transient transfection experiments in Hela cells demonstrated a 
five-fold increase in the persistence of luciferase expression in serially-passaged 
cultures at 10 days in pGL3(TTR) as compared to that of pGL3 transfected 

1 0 (Figure 7D). These findings support the hypothesis that the head-to-tail ITR 
DNA element contained within circular intermediates is responsible for 
mediating the increased persistence of transgene expression and suggest a 
mechanism by which these molecular intermediates may confer stability to AAV 
genomes in vivo. Furthermore, increases in the stability of transgene expression 

15 conferred by this element appear to be primarily context independent, since the 
head-to-tail ITR element was 3' to the luciferase gene in pGL3(ITR) and 5' to 
the GFP transgene in AAV circular intermediates. 
Discussion 

Characterization of integrated proviral structures in different cell lines 
20 has demonstrated head-to-tail genomes as the predominant structural forms for 
both wild type and recombinant AAV (McLaughlin et al., 1988; Cheung et aL, 
1980; Duan et al., 1997). This is in contrast to the head-to-head and tail-to-tail 
structures observed in AAV replication intermediates (Rim and Rfd). Both Rfin 
and Rfd configurations have also been demonstrated in rAAV infected cells and 
25 enhanced conversion of ssAA V genomes to double stranded Rfin and Rfd forms 
has been suggested as a mechanism for augmentation of rAAV transduction by 
adenovirus in cell lines (Ferrari et al., 1996; Fisher et aL, 1996). However, it is 
plausible that the mechanisms responsible for the formation of Rfin and Rfd 
molecules are different from pathways which lead to long-term transgene 
30 expression. In support of this hypothesis is a recent study evaluating 

augmentation of rAAV transgene expression by adenovirus in liver (Snyder et 
al., 1997). These studies have demonstrated that co-infection of the liver with 
adenovirus and rAAV enhances short term transgene expression while long term 
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expression was no different than rAAV alone. The exact mechanism for the 
formation of head-to-tail circular intermediates is not clear, however similar 
structures have been demonstrated to act as pre-integration intermediates for 
retrovirus (Varmus, 1982). In this regard, circularized retroviral genomes with 
5 one and two viral LTRs have been proposed. In addition, circular pre-integration 
intermediates have also been suggested by recent studies on wtAAV integration 
(Linden et al, 1996b). The demonstration that circular intermediates exist in 
rAAV infected muscle explains several features of latent phase infection with 
rAAV vectors including proviral structure and stable episomal persistence. 

1 0 Previous studies have suggested that rAAV genomes delivered to muscle 

might persist as head-to-tail concatamers (Herzog et aL, 1997; Clark et aL, 1997; 
Fisher et aL, 1997). However, it is currently unknown whether these 
concatamers exist as free episomes or as integrated proviruses in the host 
genome. The results described above, i.e., demonstrating prolonged persistence 

1 5 of head-to-tail circular intermediates at 80 days post-infection, suggest that a 
large percentage of rAAV genomes may remain episomal. The conversion of 
monomer circularized genomes to larger circularized multimers appears to be an 
aspect associated with long term persistence and likely represents 
recombinational events between monomer intermediates. Although the bacterial 

20 rescue strategy was not capable of satisfactorily addressing the size of multimers, 
our modified approach to size fractionating Hirt DNA prior to bacterial rescue of 
intermediates lends support to this hypothesis. Additional supportive evidence 
for increased recombination over time is the finding that greater variability in the 
length of ITR arrays was observed at longer time points post-infection. For 

25 example, at 5-22 days the majority of circular intermediates contained 2 ITRs in 
a head-to-tail fashion: This is in contrast to 80 day time points where the lengths 
of ITR arrays ranges from 1-3 ITRs. Such diversity of ITR arrays in muscle 
infected with AAV has been previously found using PCR approaches (Herzog et 
al., 1997; Fisher et al., 1997). In addition, the 30% decline in the abundance of 

30 circular intermediates in muscle between 22 and 80 days also supports a 

hypothesis that these molecular forms of AAV may represent pre-integration 
complexes. 



57 



WO 99/60146 



PCT/US99/1II97 



Given the fact that circular intermediates had long terra persistence in 
muscle, certain structural features of these intermediates may affect episomal 
stability of DNA. Previous studies have noted increased persistence of transgene 
expression from plasmids encoding AAV ITRs (Philip et aL, 1994; Vieweg et 
5 al. f 1995). However, the physiologic significance of this finding has remained 
elusive. The present study, demonstrating the head-to-tail ITR arrays isolated 
from AAV circular intermediates can confer increased episomal persistence to 
plasmids following transfection in cell lines, gives a mechanistic framework for 
ITR effects on plasmid persistence. Furthermore, the correlation that AAV 

1 0 circular intermediates have increased persistence in cell lines in vitro, lends 
support to the hypothesis that these structures represent stable episomal forms 
following rAAV transduction in muscle. Stability of circular intermediates in 
vivo might be mediated by the binding of cellular factors to "Holliday-like" 
junctions in ITR arrays which stabilize or protect DNA from degradation. 

1 5 rAAV has been shown to be an efficient vector for expressing transgenes 

in various tissues in addition to muscle, such as brain, retina, liver, lung, and 
hematopoetic cells (Snyder et aL, 1997; Muzyczka, 1992; Kaplitt et aL, 1994; 
Walsh et aL, 1994; Halbert et aL, 1997; Koeberl et aL, 1997; Conrad et aL, 1996; 
Bennett et aL, 1997; Flannery et aL, 1997). Despite these advances in the 

20 application of rAAV, the mechanisms of in vivo rAAV-mediated transduction 
and persistence of transgene expression still remain unclear. Such questions as 
to the molecular state of rAAV following in vivo delivery is highly relevant to 
the clinical application of this viral vector. For example, should rAAV primarily 
persist as an randomly integrated provirus, the potential for insertional 

25 mutagenesis could present a major theoretical obstacle in the use of this vector 
due to the potential for mutational oncogenesis. The demonstration that rAAV 
can persist as episomes suggest that random integration and associated risks of 
malignancy may not be a major concern for this viral vector system. 
Additionally, the molecular determinants of AAV circular intermediates 

30 associated with increased persistence in cell lines appear to be contained within 
the DNA elements encompassing the inverted ITRs. The isolation of this 
naturally occurring viral DNA element, which forms as part of the AAV life 
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cycle and acts to stabilize circular episomal DNA, may prove useful in 
increasing the efficacy of both viral and non- viral gene therapy vectors. 

Example 3 

Evidence for Tnrxeas ft d Fpisomal Persistence of AAV Circular intermedi a tes in R 
5 Model for in utern Plasmid-ftased Gene Therapy 

Persistence of AAV circular intermediates were assessed by injection of 
plasmid DNA directly into the pronucleus of fertilized Xenopus oocytes. 
Twenty-five ng of the p81 isolate of AAV circular intermediates was injected at 
the single cell stage of fertilized Xenopus oocytes. This plasmid was compared 

10 to the proviral plasmid pCisAV.GFP3ori, which contains two ITRs separated by 
stuffer sequence in an alternative confirmation to ITRs in p81 , Figure 13 depicts 
the persistence of GFP plasmids as assessed by direct fluorescence of GFP. At 
this state of tadpole development, the fertilized oocyte has expanded from a 
single cell to approximately 10 6 cells. 

1 5 < These studies confirm that AAV circular intermediates (p8 1 ) confer a 

higher level of stability in development Xenopus oocytes than plasmids 
containing similar transcriptional elements and ITR sequences in an alternative 
confirmation. Given that in the case of p81 injected oocytes, tadpoles are 
completely fluorescent, the data suggests that some level of integration may have 

20 occurred 

Example 4 

Liposome Mediated Transfer nf Vectors o f the Invention to the Airway and 
Muscle 

Studies evaluating the mechanisms of recombinant adeno-associated 
25 virus (AAV) transduction have identified a novel molecular intermediate 
responsible for episomal persistence. This intermediate is characterized by a 
circularized AAV genome with head-to-tail ITR repeats. Circular intermediates 
of rAAV were identified using a recombinant shuttle vector capable of 
propagating circularized viral genomes in bacteria. Pivotal experiments in cell 
30 lines demonstrate that the formation and persistence of these circular 

intermediates are augmented in the presence of helper adenovirus. These 
findings suggest that cellular factors induced by adenoviral gene expression may 
modulate both the formation and/or persistence of AAV circular intermediates. 
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Furthermore, studies in muscle have demonstrated that following rAAV 
infection, the formation and persistence of AAV circular intermediates correlates 
with the onset and maintenance (at 80 days) of transgene expression, 
respectively. Moreover, a 300 bp fragment encompassing the head-to-tail 

5 inverted ITR repeats found in AAV circular intermediates when cloned into 
heterologous expression plasmids can confer increased stability to those 
plasmids in HeLa cells. The structural aspects of AAV circular intermediates 
may lead to development of non- viral, plasmid based, gene transfer vectors with 
increased persistence of transgene expression. 

10 To determine whether AAV circular intermediates which differ in length 

and/or sequence of the ITR array are more efficacious plasmid based vectors for 
liposome-mediated gene transfer to the airway and muscle, several distinct forms 
of AAV circular intermediates are evaluated as plasmid-based delivery systems 
in three model systems of the airway including: 1) in vitro polarized primary 

15 airway epithelial monolayers, 2) mouse lung, and 3) human bronchial 

xenografts. Persistence is evaluated at both the level of transgene expression 
(using GFP and luciferase reporters) and at the level of episomal and integrated 
transgene derived DNA. Studies are performed to assess whether integration can 
be specifically enhanced by co-transfection with Rep DNA or raRNA. These 

20 studies also evaluate both the extent of integration and site specificity to AVS1 
sites in chromosome 19 of human model systems. 

Gene therapy using plasmid-based delivery systems have encountered 
several obstacles to efficient transgene expression These obstacles include 
transient expression of transgenes and rapid degradation of DNA. In contrast, 

25 viruses have developed efficient mechanisms for transducing cells and 

expressing encoded viral genes. The molecular characteristics of AAV circular 
intermediates which confer increased persistence of transgene expression include 
a DNA element encompassing the head-to-tail ITR. Based on the findings that 
circular intermediates have increased episomal persistence in muscle following 

30 rAAV transduction, these structures may also have increased persistence as 
plasmid-based vehicles to the airway. Interestingly, several naturally occurring 
mutations which are found in approximately 50% of AAV circular intermediates 
affect the stability of the intermediate. 
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Several findings evaluating the efficiency of AAV circular intermediate 
formation from recombinant viral vectors have suggested that these structures 
are augmented in abundance by the presence of the E2a adenoviral gene product. 
These molecular structures may represent preintegration intermediates which, in 

5 the case of wild-type AAV, would efficiently integrate into the cellular genome 
by Rep facilitated mechanisms. However, in the case of recombinant AAV 
genomes (in the absence of Rep proteins), evidence suggests that these structures 
have increased episomal stability. To test whether exogenous addition of Rep 
and/or E2a can increase the efficacy of AAV circular intermediates by 

10 modulating their stability and/or integration, co-transfection methods with Rep 
encoding plasmids and mRNA are conducted. Additionally, exogenously 
supplied E2a DNA binding protein (DBP) may also enhance stability of AAV 
circular intermediates. Rep may increase the integration of circular 
intermediates while E2a may increase their episomal stability. Several 

1 5 observations including the association of E2a DBP with AAV genomes in the 
nucleus support a direct interaction between DBP and AAV circular 
intermediates. Furthermore, if DBP associates with AAV circular intermediates, 
its encoded nuclear localization sequence (NLS) may enhance nuclear 
sequestration of these plasmids in the nucleus. Alternatively, E2a may act to 

20 alter the persistence of AAV circular intermediates through the induction of 
cellular factors which interact with the ITR array. 

Liposome mediated gene transfer to the airway has considerable 
advantages due to the low level of toxicity. However, limitations include 
transient low level expression in differentiated airway epithelia. Despite this 

25 apparent limitation, several laboratories have had considerable success with the 
use of cationic liposome-mediated gene transfer in several animal models 
including mouse and rat lung, and numerous laboratories have pursued clinical 
trials, which suggested that these vehicles may show promise for gene therapy of 
the cystic fibrosis (CF) lung. Thus, delivery of the present vectors in plasmid 

30 form via liposomes may be a safe and effective vehicle for gene transfer to the 
airway. 

To assess whether AAV circular intermediates may also have increased 
persistence in airway epithelial cells as seen in Hela cells, several distinct forms 
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of circular intermediates delivered by liposome-mediated transfection into 
primary airway epithelial cells, are evaluated. Based on the diversity of ITR 
repeat elements between various isolated circular intermediates (i.e., including 0, 
1, 2, and 3 ITRs), circular intermediates isolated from later time points in muscle 
5 may have been naturally selected for increased stability in vivo. Hence, the 

structural consistencies between AAV circular intermediates are identified which 
give increased persistence as plasmid based vectors for gene transfer. 

Circular intermediates containing the GFP reporter gene and 1, 2, and 3 
ITRs are transfected into primary airway cultures and polarized epithelial cell 

1 0 monolayers using the cationic lipid GL-67 (Genzyme Inc.). DNA to lipid ratios 
are optimized using a luciferase reporter. Additionally, the addition of EGTA, or 
the use of calcium-free media, can increase the extent of gene transfer about 10- 
fold, and may be included to enhance gene transfer to polarized epithelial 
monolayers. To evaluate persistence and expression of transgenes from circular 

1 5 intermediates, direct fluorescent microscopy and Southern blotting of both Hirt 
and genomic DNA with GFP P^-labeled probes are utilized. Proliferating 
cultures of primary airway epithelial cells can be passaged up to 4 times during 
this analysis. In contrast, polarized epithelial monolayers are evaluated at 1 
week intervals for DNA persistence for up to 6 weeks. Since GFP transgene 

20 expression may be low and difficult to detect by direct fluorescence, GFP is 
quantitated by fluorometer of cell lysates. 

Following AAV transduction, circular intermediates may form within 
cells and certain structures of these intermediates may persist by virtue of 
affinity for cellular factors which bind at ITR arrays. If this is true, then it may 

25 be possible to select for and isolate optimal circular intermediates with increased 
persistence in airway cells by batch screening of circular intermediates pools 
from rAAV infected airway epithelia. 

Primary airway epithelia cell cultures are infected with AV. GFP3ori 
(MOIs of 1000 to 10,000 DNA part/cell) and low molecular Hirt DNA is 

30 prepared at 5- 1 5 days post-infection. Hirt DNA containing circular 

intermediates from rAAV infected cells is used to then transfect primary airway 
epithelial cells from which Hirt DNA is prepared at 5-15 days post-transfection. 
This second Hirt isolation is then used to isolate replication competent plasmids 
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following transformation into bacteria. This selection process may give rise to 
those populations of circular intermediates with increased episomal persistence 
in airway epithelial cells. Selected clones of circular intermediate plasmids 
isolated by this procedure are then tested individually for increased persistence 

5 following liposome mediated transfection. These studies are performed in a 
batch type screening in 24 well plates using two serial passages for persistence. 
Once plasmids having increased persistence are isolated, their structure and 
sequence of ITR arrays are characterized. Since screening is performed on 
small-scale cultures, it may be necessary to implement semi-quantitative 

10 screening for DNA persistence within the first round of transfection using PCR 
methods. Candidate plasmids with a high level of increased persistence as 
compared to control plasmids which lack ITR sequences but contain the identical 
promoter-reporter element, are evaluated on a larger scale transfection amenable 
to analysis by Southern blotting of total DNA. 

1 5 To evaluate selected circular intermediate structures in vivo, two models 

including mouse lung and the human bronchial xenograft are employed. 10 wk 
BalbC mice are transfected with GL-67/DNA complexes at a ratio of 25 ng 
plasmid/25 \ig lipid in an iso-osmotic solution of Dextrose. At 1, 5, 10, 15, and 
20 days post-transfection lungs of mice are harvested for immunofluorescent 

20 detection of GFP in formalin fixed sections and for quantitative fluorometiy of 
tissue lysates. Southern blots are employed to evaluate the persistence of 
plasmids in Hirt and genomic DNA. In addition to evaluating the persistence of 
selected circular intermediates which have the highest level of persistence with 
in vitro models, luciferase constructs are evaluated in which the ITR array has 

25 been cloned either 5 ' or 3 ' to the reporter gene. Furthermore, the use of 

luciferase reporters allows for more sensitive assessment of transgene activity in 
cell lysates. 

Similarly, in vivo persistence of transfected circular intermediates and 
heterologous plasmids containing ITR arrays found within circular intermediates 
30 is evaluated in human bronchial xenografts. 

Findings evaluating the effects of adenoviral co-infection on circular 
intermediate formation and persistence have suggested that E2a DBP leads to a 
10-fold increase in the abundance of circular intermediates as compared to E2 
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deleted virus. Furthermore, studies with El-deleted virus have demonstrated that 
the persistence of circular intermediates in Hela cells is increased at 72 hours 
post-infection. These studies suggest that E2a DBP may augment circular 
intermediate formation and/or increase the stability of these structures by an 
5 unknown mechanism. E2a DBP may interact directly with circularized genomes 
and/or induce cellular factors which interact with sequences in these AAV 
genomes. Since DBP encodes an NLS, this protein may act to shuttle circular 
intermediates to regions of nucleus that allow for increased stability of these 
structures. NLS sequences have been shown to cooperatively interact with 

10 nucleolar targeting sequences and hence we will also evaluate if subnuclear 
targeting is important in maintaining the increased stability of circular 
intermediates containing ITR arrays. Furthermore, it is currently unknown 
where circular intermediates form in the cell and it remains plausible that they 
may form in the cytoplasm or nucleus. Hence, if DBP associates directly with 

1 5 circular intermediates, it may act as an NLS for DNA to enter the nucleus as 
well. 

Several in vitro reconstitution models are used to investigate the 
interaction of circular intermediates with DBP and their affect on in vivo 
persistence following DNA transfection in Hela cells. Furthermore, results 

20 evaluating the affects of various mutant adenoviral vectors on circular 

intermediate and Rfin/Rfd formation have suggested that these two types of 
intermediates occur by independent pathways indicative of latent and lytic 
infection, respectively. In the setting of wild type AAV, circular intermediates 
may be pre-integration complexes, which in the presence of Rep, efficiently 

25 integrate into the host genome. In contrast, in the absence of Rep, circular 
intermediates may accumulate episomally in rAAV infected cells. To this end, 
methods of supplementing Rep function may be capable of enhancing integration 
of plasmid based delivery of AAV circular intermediates. Thus, experiments in 
which co-transfection of circular intermediate plasmids with Rep expression 

30 plasmids or mRNA are conducted. 

To investigate whether DBP can augment the stability of circular 
intermediates by increasing targeting to the nucleus, a Hela cell line (gmDBP6) 
is utilized which encodes an inducible E2a gene under a dexamethasone 
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responsible element. This cell line gives rise to high levels of DBP in nuclear 
extracts by Western blot following treatment with dexamethasone. gmDBP6 
cells (+/- DEX) are transfected with various AAV circular intermediate plasmids 
containing 0, 1, 2, and 3 ITRs and total cellular and nuclear plasmid content 
5 evaluated by subcellular fractionation using Southern blotting against GFP 
probes. The time course of these studies is initially within the range of 12 hours 
to 4 days post-transfection. Transgene expression is evaluated by fluorometry 
(in cell lysates), and fluorescent microscopy (in viable cells), for GFP and 
luminescence for luciferase. Hela cells have demonstrated that immediate 
10 increases in transgene expression from AAV GFP circular intermediates as 

compared to control GFP plasmids occur as early as 24 hours post-transfection. 
Thus, certain cellular factors may facilitate an immediate accumulation of 
circular intermediates in the nucleus. DBP may invoke this increase by either 
direct interactions with ITR sequences or by the induction of cellular factors. To 
1 5 evaluate the potential for direct interactions between DBP and circular 

intermediates, various form of ITR arrays found within circular intermediates are 
end-labeled with y-ATP 32 and evaluated for binding by electrophoretic mobility 
shift assays to nuclear extracts from gmDBP6 cells (+/- DEX). Supershifts, with 
DBP antibodies and competition experiments with cold ITR sequences and non- 
20 specific DNA, are used as controls for specific binding. 

In a second model system aimed at evaluating the potential of DBP for 
shuttling and/or sequestering of circular intermediates to the nucleus, 
microinjection experiments in oocytes are performed with 50 ng of plasmid 
DNA of circular intermediates with and without 50 ng of DBP mRNA. 
25 Experiments initially evaluate the time course of GFP transgene expression (+/- 
DBP cRNA) by direct fluorescent microscopy. If major differences are seen, 
quantitative fluorometry of individual whole oocytes in 96 well plates is 
conducted. Similar studies on nuclear targeting in the presence of DBP can also 
be evaluated in this model by pooling microinjected oocytes for nuclear isolation 
30 and Southern blot analysis. 

A third experimental model to evaluate nuclear targeting and/or 
accumulation of circular intermediate vectors in the presence and absence of 
DBP involves the microinjection of fluorescently labeled plasmid DNA into the 
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cytoplasm and real time imaging to follow the nuclear accumulation of DNA. 
The DNA fluorescent dye, TOTO-1, is used to label DNA prior to injection. 
This dye forms an extremely stable complex with negligible diffusion and re- 
incorporation into nuclear DNA following transfection into polarized airway 
5 epithelial cell monolayers. Co-localization of DBP with wtAAV DNA genomes 
at focal hot spots within the nucleus supports the observation that nucleolar 
targeting may be important for persistence. These experiments are also 
performed in primary airway epithelial cells and in vivo models of the airway by 
either co-transfection of circular intermediates with DBP expressing plasmids 

10 and/or mRN A. 

The effects of Rep co-transfection on the integration of circular 
intermediate plasmids is also evaluated. Two methods are used to express Rep 
including: 1) co-transfection with Rep expressing plasmids, and 2) co- 
transfection with Rep encoding mRNA. Initially, Hela, CFT1, and IB-3 cells are 

1 5 tested, as transformed cells may be more amenable to expansion and evaluation 
of integration. Both CFT1 and IB-3 cells represent airway epithelial cells. 
Experiments are performed by cationic liposome (GL-67) mediated transfection 
of circular intermediate DNA with varying doses of a Rep-containing expression 
vector, e.g., pCMVRep. The extent of integration is also evaluated by two 

20 criteria, Southern blotting of Hirt and genomic DNA and clonal expansion of 
GFP expressing cells. Since Southern blot has an approximate limit of 
sensitivity of 1 integrated plasmid molecule per 10 cellular genomes, clonal 
expansion may be necessary to evaluate persistence in less transfectable cells 
such as CFT1 and EB-3 cells. Cell lines are evaluated over the course of 1-10 

25 passages. 

Sustained expression of Rep by plasmid mediated co-transfection may be 
toxic to cells, hence co-transfection with Rep mRNA is also evaluated. Cationic 
liposome:mRNA mediated transfection has been previously shown to work in 
cell lines and although the level of expression is much more transient than for 
30 DNA, in these studies it may be an advantage. Initial studies are performed with 
in vitro transcribed Rep mRNA alone to evaluate the jig amount of mRNA 
needed for Rep expression as determined by Western blot. Once the threshold 
for detectable Rep expression is established, increasing amounts of Rep mRNA 
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are co-transfected with circular intermediate DNA. Similar assays are used as 
described above to evaluate the extent of AAV circular intermediate integration. 
If findings suggest that increased integration if facilitated by Rep, the site 
specificity of this integration can be evaluated by cloning GFP expressing cells 

5 after the 10th passage by serial dilution. These GFP expressing clones are 
expanded and genomic Southern blots assessed with both GFP and AVS1 
specific probes. By evaluating a number of restriction enzymes which either do 
not cut or cut once within the circular intermediate plasmid, it will be determined 
whether integration has occurred at the AVS1 loci. 

1 0 To test whether secondary structure rather than primary sequence is the 

important determinant of increased episomal stability of AAV circular 
intermediates, synthetic DNA sequences are generated with identical secondary 
structure to several ITR arrays in circular intermediates. The primary sequence 
is completely altered and bares no resemblance to sequences contained within 

1 5 native AAV ITRs. These synthetic DNA sequences are tested for their ability to 
confer increased episomal stability to heterologous plasmids in several model 
systems including: 1 ) the airway, 2) muscle, 3) and developing Xenopus 
embryos. The developing Xenopus embryo model is ideal for testing integration 
and persistence of plasmid based vectors for application of in utero gene therapy. 

20 If synthetic DNA sequences with similar secondary structure to ITRs are found 
to confer increased persistence to plasmid based vectors, then determinants for 
protein binding which facilitate persistence are independent of primary base 
sequence. These studies allow the optimization of the secondary structural 
requirements by synthesizing a wide range of DNA molecules with varying 

25 degrees of palindromic repeats. Furthermore, the secondary structure may not 
bind proteins directly but facilitate recombination of plasmids to large 
concatamers which have increased episomal stability or enhanced integration 
efficiencies. 

Example 5 

30 Delivery of Multiple Genes through Intermolecular Concatamerization 

Methods 

Recombinant AAV vectors. 



67 



WO 99/60146 



PCT/US99/11I97 



Two rAAV vector stocks were generated for use in these studies, 
AV.GFP3ori (Example 1) and AV.Alkphos (also known as CWRAPSP, a gift of 
Dusty Miller) (Halbert et aL, 1997). Virus stocks were generated by co- 
transfection of 293 cells with either pCisAV.GFP3ori or pCWRAPSP along with 
5 pRep/Cap, followed by co-infection with recombinant AdCMVlacZ helper virus 
(Example 2). rAAV was then purified through three rounds of CsCl density 
gradient centrifugation as previously described by Duan et aL (1997). Purified 
viral fractions were heated at 60°C for 1 hour to inactivate any residual 
contaminating helper adenovirus. The yields for AV.GFP3ori and AV.Alkphos 

10 were 1 x 10 12 and 7 x 10 u particles per ml, respectively, as determined by slot 
blot hybridization with 32 P-Iabeled GFP or Alkphos probes. Infectious titers 
determined by infection of 293 cells with rAAVs were 1.1 x 10 9 IU/ml 
(AV.GFP3ori) or 8.6 x 10* IU/ml (AVAlkphos). Controls testing for 
contamination of rAAV stocks with wtAAV by anti-Rep immunocytochemical 

1 5 staining jn rAAV/Ad.CMVlacZ co-infected 293 cells were negative (limit of 
sensitivity is less than 1 infectious wtAAV particle per 10 10 DNA particles of 
rAAV). Similarly, histochemical staining for p-galactosidase in rAAV infected 
293 cells showed no detectable contamination with helper adenovirus in 10 10 
DNA particles of rAAV (limit of sensitivity). 

20 Infection of muscle tis sne and evaluation of ttansgene expression. 

The C57BL/6 mice used for these experiments were housed in a virus- 
free animal care facility and were maintained under strict University of Iowa and 
NIH guidelines, using a protocol approved by the Animal Care and Use 
Committee and facility veterinarians. Four to five week old mice received 

25 bilateral 30 \il injections of a mixture of both AV.GFP3ori and AV. Alkphos into 
the tibialis anterior muscle (5 x 10 9 DNA particles of each virus per muscle). 
Controls included uninjected muscles and muscles receiving injections of one of 
the viruses alone. At 14, 35, 80, and 120 days post-infection, animals were 
euthanized and tissues were harvested for evaluation of transgene expression and 

30 preparation of low molecular weight Hirt DNA. For each experimental time 
point, at least 3 independently injected muscles were evaluated. 

In all experiments, GFP fluorescence was visualized in freshly excised 
muscle tissue prior to processing. A portion of the same muscle was fixed with 
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2% paraformaldehyde in phosphate buffered saline, and cryoprotected in graded 
sucrose solutions before embedding in optimal cutting temperature medium 
(OCT). Sections (6 jim) were then evaluated for GFP expression directly and 
Alkphos expression following heat inactivation of endogenous Alkphos and 
5 histochemical staining for Alkphos activity (Engelhardt et aL, 1995). To confirm 
dual localization of GFP and Alkphos expression in the same muscle fibers, 
either serial sections were evaluated for GFP and Alkphos expression or the 
same section was first photographed for GFP expression followed by 
histochemical staining for Alkphos and re-imaging of the same field. 

10 Rescue of circular intermediates from muscle Hirt DNA . 

Low molecular weight Hirt DNA was prepared from 20 mg specimens of 
injected muscles from 3 animals at each time point (Example 2). Hirt DNA 
(4 jil; 1/5 of the total volume) was then used to transform 50 ^1 of 
electrocompetent SURE cells (Stratagene) using a BioRad E. coli electroporater 

15 and 0. 1 p cuvettes. Colonies resulting from each bacterial transformation were 
quantified, and plasmids from 20 colonies from each muscle Hirt DNA sample 
were purified for analysis. It should be noted that only circular forms carrying 
the Amp resistance gene and the bacterial origin of replication from AV.GFP3ori 
are rescued by bacterial transformation (Duan et aL, 1998). Control experiments 

20 reconstituting 5 x 1 0 10 viral DNA particles into uninfected muscle extracts prior 
to Hirt DNA preparation failed to give rise to replication competent plasmids in 
the rescue assay (Duan et ah, 1998). Additional controls in Duan et al. (1998) 
using AV.GFP3ori virus also demonstrated that linear double stranded and single 
stranded purified viral DNA genomes do not give rise to replication competent 

25 plasmids following transformation into E colL 

Characterization of encoded genes in rescued circular intermediates. 

Several assays were used to characterize the extent of intermolecular 
recombination between independent circular viral genomes by evaluating the 
number and type of encoded genes in rescued plasmids from Hirt DNA of 

30 muscles co-infected with AV.GFP3ori and AV.Alkphos. Initial analysis 
involved the bulk evaluation of 60 rescued plasmids (20 from each of three 
muscle samples for each time point) by dot blot hybridization of mini-prep DNA 
with EGFP, Alkphos, and Amp M P-labeled DNA probes. In these studies, Amp 
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hybridization served as a control to show that there was a sufficient quantity of 
DNA for the analysis. The percentages of AUcphos and/or GFP hybridizing 
plasmids were calculated by this method for each muscle sample. From this 
percentage, the total number of plasmids hybridizing to each probe in the Hirt 
5 DNA sample was calculated from the total CFU obtained in each transformation. 
In this analysis, each muscle sample was evaluated independently to determine 
the mean (+/-SEM) total Alkphos and/or GFP hybridizing plasmids. A second 
evaluation involved the transfection of rescued plasmids into 293 cells using 
lipofectamine, followed by evaluation of GFP fluorescence and histochemical 

1 0 staining for Alkphos. To confirm that GFP and Alkphos co-expressing plasmids 
were indeed clonal and that both genes were encoded on the same plasmid, a 
selected group of five co-expressing plasmids were retransformed into E.coli and 
colonies were re-isolated prior to repeating the transfection studies. In all cases, 
plasmids co-expressing the two reporter genes remained clonal through this 

1 5 subsequent re-isolation. 

Structural analysis of concatamer rAAV circular intermediates. 

To further characterize the nature of isolated circular intermediates co- 
expressing both GFP and Alkphos transgenes, plasmid structure was mapped by 
Southern blotting and restriction enzyme analysis. The structural of five co- 

20 expressing circular intermediate plasmids were determined by digestion with 
Ahdl, Hindin, NotI, HindEI/Notl, Clal/Asel, and/or SnaBI and Southern 
blotting was performed with * 2 P-labeled GFP, Alkphos, and ITR probes. 
Results 

Strategy for characterising mechanisms of rAAV circular intermediate 
25 formati on. 

Efficient circularization of rAAV genomes has been previously 
demonstrated to occur in muscle in a time dependent fashion (Example 2). 
Furthermore, the conversion of monomelic to multimeric circular rAAV 
intermediates occurred over time and was associated with long-term episomal 
30 persistence of AAV genomes. High molecular weight AAV circular genomes 
might form by either of the following two mechanisms, one involving the 
replication of monomer structures and the other through intermolecular 
recombination between independent monomers. A rescue assay was developed 
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using two separate rAAV vectors, AV.GFP3ori and AV.Alkphos (Figure 14A), 
which allowed for the identification of independent viral genomes through 
unique transgenes. In this assay, circular form genomes were rescued in bacteria 
by virtue of Amp/ori sequences encoded in one of the two vectors 
5 (AV.GFP3ori). A method for characterizing the extent of intermolecular 
recombination between independent circular rAAV genomes was shown in 
Figure 14B. 

Co-expression of ind ependently enenried rAAV transgenes in muscle mvofihexs. 
To confirm that myofibers can be co-infected at a high efficiency with 

1 0 the two rAAV vectors, the tibialis anterior muscle of mice was co-infected with 
5 x 10* DNA particles of both AV.GFP3ori and AV.Alkphos. At 14, 35, 80, and 
120 days post-infection, muscles were harvested and analyzed for transgene 
expression. Transgene expression from both reporters was weak but clearly 
visible in 14 day muscle samples. By 80 days post-infection, transgene 

1 5 expression was maximal and serial sections demonstrated expression of both 
Alkphos and GFP transgenes in overlapping regions of the muscle (Figures 15A- 
C). At this time point, approximately 50% of the fibers in the tibialis muscle 
expressed both transgenes. To confirm that co-infection of myofibers occurred 
with the two independent vectors, co-localization studies were performed on 

20 muscle sections by a serial staining procedure. These studies, depicted in Figure 
15D, demonstrate four classes of myofiber transgene expression: 1) GFP positive 
only, 2) Alkphos positive only, 3) GFP/Alkphos positive, and 4) no transgene 
expression. The largest fraction of myofibers expressed both GFP and Alkphos 
transgenes. These results confirm that at the titers of virus used for infection, co- 

25 infection occurred in greater than 90% of transgene expressing myofibers. 
Rescue nf hi-functional rAAV circular intermediates increases over time . 

To determine the extent of recombination between circular AAV 
genomes, circular form genomes were rescued as plasmids from low molecular 
weight Hirt DNA of muscle tissue co-infected with AV.GFP3ori and 

30 AV.Alkphos. Following transformation of Kcoli Sure cells with Hirt DNA 
purified from infected muscles, the total number of GFP and Alkphos 
hybridizing Amp resistant bacterial plasmids was quantitated for each time point 
post-infection (Figure 16A and B) (Duan et aL, 1995), the abundance of circular 
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AAV genomes rescued from AV.GFP3ori increased over time. For each muscle 
sample (three for each time point) twenty plasmid clones were evaluated for 
hybridization to GFP and Alkphos DNA probes and the total number of plasmids 
was back calculated from the total CFU for each individual muscle sample. 

5 Figure 16B demonstrates the mean (+/-SEM, N=3) total plasmids that hybridized 
to GFP or GFP/Alkphos probes at each time point At 14 days post-infection, 
GFP/ Alkphos co-hybridizing plasmids were never observed. In contrast, at time 
points after 35 days the percentage of GFP/Alkphos co-hybridizing plasmids 
increased with time and reached 33% by 120 days (Figure 16C). Since bacterial 

10 plasmid rescue can only occur through AV.GFP3ori genomes, this data suggests 
that recombination between independent Alkphos and GFP rAAV genomes takes 
place over time. These results are consistent with studies described hereinabove 
demonstrating a time dependent concatamerization of monomer circular rAAV 
genomes in muscle. 

1 5 To evaluate the ability of circular intermediates to express encoded 

transgenes, transient transfection studies were performed in 293 cells with 
rescued circular intermediate plasmids (Figures 17A-C). Between 85-90% of 
rescued plasmids hybridizing to GFP probes on slot blots also expressed the GFP 
transgene in this transfection assay (Figure 17D). The percentage of GFP 

20 expressing plasmids that also expressed Alkphos rose over time in concordance 
with the hybridization data (Figure 17D). However, approximately 40-50% of 
plasmids which were hybridization positive for Alkphos did not express the 
Alkphos transgene. This may represent recombinational deletion of the RSV 
promoter driving Alkphos expression which occurred during concatamerization 

25 at sites near the 5' ITR. These results demonstrate that intermolecular 

recombination between Alkphos and GFP derived circular intermediates occurs 
as part of the time dependent concatamerization process of rAAV in muscle. To 
confirm that amplified plasmids stocks expressing both reporter genes were 
actually clonal (i.e., one plasmid rather than two independent plasmids resulting 

30 from contamination), a select number of bacterial clones expressing both 
transgenes were re-isolated and the transfection assays were repeated. In all 
cases, plasmids expressing the two reporter genes remained clonal through two 
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rounds of bacterial cloning. Hence, dual reporter expression was not due to 
contamination of independent GFP and Alkphos expressing plasmids. 
Tnnratamerization o f A A V circular intermediates occurs through uniform 
mtPrmnWular rprom h inarinn between TTRs of indpmftnrifMlt viral gnomes . 

5 To better understand the mechanisms of circular concatamer formation, a 

detailed structural analysis was performed of five bi-functional circular 
concatamers isolated from rAAV infected muscle samples. As previously 
described for the AV.GFP3ori genome (Example 2), the conversion of 
monomelic circular AAV genomes to large multimeric circular concatamers with 

1 0 a predominant head-to-tail structure increased with time in muscle. To evaluate 
the structure of bi-fiinctional circular concatamers, restriction enzyme mapping 
and Southern blot analysis using 32 P-labeled EGFP, Alkphos, and ITR probes 
was employed. Results from five analyzed plasmids demonstrated between 3-6 
genomes within these circular concatamers. Two representative structures from 

15 35 and 80 day time points are shown in Figure 18. Several interesting 

conclusions can be made from this structural analysis. As described , head-to- 
tail oriented genomes could be seen in all isolated concatamers. However, 
several examples of head-to-head and tail-to-tail genome combinations of 
AV.Alkphos and AV.GFP3ori were also seen. Since head-to-head and tail-to- 

20 tail genome concatamers were never seen in muscles infected with AV.GFP3ori 
alone, there must be a selective disadvantage for bacterial replication when ori 
sequences are in either of these conformations. However, since the AV.Alkphos 
genomes do not contain a bacterial origin of replication, this orientation is 
permitted in chimeric concatamers. Second, noticeable deletions and/or loss of 

25 restriction sites close to ITRs were noted (Figure 1 7). It is not known whether 
deletions close to the ITR are a common event in the concatamerization process, 
but if so, this could account for the fact that only 60% of GFP/Alkphos 
hybridizing circular intermediates also expressed the Alkphos transgene. 
Discussion 

30 Concatamerization of rAAV genomes has long been recognized in 

integrated proviral genomes. Recently, the association of this concatamerization 
process with the formation of high molecular circular genomes in muscle has 
suggested that this process may also be important in episomal persistence. The 
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findings described herein demonstrated rescue of independent viral genomes 
within the same circular concatamer, suggesting that this process of 
concatamerization occurs through intermolecular recombination. Furthermore, 
at 14 days the predominant form of viral genome in muscle was circular 
5 monomers (Example 2), which correlates with the results described above 

demonstrating only GFP expression in rescued circular inteimediates at this time 
point. Together with the fact that bi-functional rescued circular concatamers 
increase with time, these results suggest that large concatamers form by 
recombination of monomelic circular precursor genomes. Furthermore, since an 
1 0 alternative model of concatamerization by rolling circular replication would be 
expected to yield only GFP expressing rescued plasmids in this system, this 
mechanism does not appear responsible for concatamerization. 

Based on the structural analysis of these bi-functional circular 
intermediates, recombination between monomeric circular rAAV genomes is 
1 5 likely facilitated through ITR sequences. Directionality of this recombinational 
event does not appear to play a significant role, since head-to-tail, head-to-head, 
and tail-to-tail oriented intermolecular concatamers were found. In addition, the 
extent to which recombination within ITR repeat regions occurs in bacteria is 
presently unknown and may account for the deletions and/or restriction site 
20 losses near ITR arrays. However, serial passaging of bi-functional circular AAV 
genomes in bacteria has suggested that the structure of these large concatamers is 
impressively stable in bacteria. 

Intermolecular recombination of rAAV genomes to form single circular 
episomes may be particularly useful for gene therapy. For example, large 
25 regulatory elements and genes beyond the packaging capacity of rAAV may 
become linked after co-infecting tissue with two independent vectors 
(Figure 19). This strategy could also involve trans-splicing vectors encoding two 
independent regions of a gene which are brought together to form an intact 
splicing unit by circular concatamerization. 
30 For example, two independent vectors encoding two halves of the CFTR 

gene flanked by donor and acceptor splice site sequences are prepared. 
Expression of functional CFTR protein results after splicing of RNA transcribed 
from a concatamerized genome comprising both halves of the gene in the sense 
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orientation. One rAAV vector may comprise the first 3.3 kb of the CFTR gene 
under the control of the RS V promoter and an in-frame splice donor site at the 3 ' 
end of the CFTR cDNA- The second rAAV vector encodes a splice acceptor 
intronic sequence, the 3' 1 .4 kb of the CFTR gene, and SV40 poly-adenylation 

5 sequences. To test for efficient splicing, a chimeric vector (pcDNA3.1 CFTR- 
Donor/Acceptor) is introduced to Xenopus oocytes by nuclear injection of the 
vector, followed by two electrode voltage (TEV) clamp recording functional 
analysis of CFTR (Jiang et al. t 1998). mRNA transcripts are also analyzed for 
correct splicing following transfection of pcDNA3.1CFTR-Donor/Acceptor into 

10 MDCK cells. Polarized airway epithelial cells grown at the air-liquid interface 
are co-infected with the donor and acceptor CFTR AAV vectors CFTR gene 
expression in these cells is then monitored by both immunofluorescent 
localization and functional analysis of short circuit currents (Smith et al., 1992; 
Smith et al., 1990). Hirt analyses of episomal AAV species are used to correlate 

1 5 the efficacy and persistence of CFTR gene expression with the formation of 
AAV circular intermediates. 
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• • ♦ 
• • 



82 



EDITORIAL NOTE - NO 40912/99 

The following Sequence listing is part of the description. 
The claims follow on page 83 to 92. 



WO 99/60146 



1 



PCT/US99/11I97 



10 



SEQUENCE LISTING 

<110> University of Iowa Research Foundation et. al . 

<120> Adeno-associated virus vectors and uses 
thereof 

<130> 875.007WO1 

<1S0> US 09/276,625 
<151> 1999-03-25 



<150> US 60/086,166 
15 <151> 1998-05-20 

<160> 7 



20 



25 



<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 20 
<212> DNA 

<213> Adeno-associated virus 

<400> 1 
cgggggtcgt tgggcggtca 



<210> 2 
30 <211> 19 

<212> DNA 

<213> Adeno-associated virus 

<400> 2 
35 gggcggagcc tatggaaaa 

<210> 3 
<211> 505 
<212> DNA 
40 <213> Artificial Sequence 
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60 
120 
180 
240 
300 
360 
420 
4B0 
505 



<220> 

<223> A consensus sequence of inverted terminal repeats 
<400> 3 

5 cgggggtcgt tgggcggtca gccaggcggg ccatttaccg taagttatgt aacgactgca 
ggcatgcaag ctcgaattca tcggtagata agtagcatgg cgggttaatc attaactaca 
aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcgctgagg 
ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc 
gagcgcgcag ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg 
10 ggcgaccttt ggtcgcccgg cctcagcgag cgagcgagcg cgcagagagg gagtggccaa 
ctccatcact aggggttcct tgtagttaat gattaacccg ccatgetact tatctacagc 
ttgcatgcat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc 
tggcgttttt ccataggctc cgccc 

15 <210> 4 

<211> 272 
<212> DNA 

<213> Adeno-associated virus circular intermediate, clone p8l 
20 <400> 4 

gcatgcaagc tgtagataag tagcatggcg ggttaatcat taactacaag gaacccctag 
tgatggagtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcggccaa 
aggtcgcccg acgcccgggc tttgcccggg cggcctcagt gagcgagcga gcgcgcagag 
agggagtggc caactccatc actaggggtt ccttgtagtt aatgattaac ccgccatgct 
25 acttatctac cgatgaattc gagcttgcat gc 

<210> 5 
<211> 300 
<212> DNA 

30 <213> Adeno-associated virus circular intermediate, clone p81 

<400> 5 

gcatgcaagc tgtagataag tagcatggcg ggttaatcat taactacaag gaacccctag 
tgatggagtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgcgcgc 
35 tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga gcccgggctt tgcccgggcg 180 
gcctcagtga gcgagcgcgc gcgcagagag ggagtggcca actccatcac taggggttcc 240 
ttgtagttaa tgattaaccc gccatgctac ttatctaccg atgaattcga gcttgcatgc 300 



60 
120 
180 
240 
272 



60 
120 



<210> 6 
40 <211> 272 
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3 

<212> DNA 

<213> Adeno-associated virus circular intermediate, clone pBl 
<400> 6 

5 gcatgcaagc tgtagataag tagcatggcg ggttaatcat taactacaag gaacccctag 
tgatggagtt ggccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgaccaa 
aggtcgcccg acgcccgggc tttggtcgcc cggcctcagt gagcgagcga gcgcgcagag 
agggagtggc caactccatc actaggggtt ccttgtagtt aatgattaac ccgccatgct 
acttatctac cgatgaattc gagcttgcat gc 

10 

<210> 7 
<211> 165 
<212> DNA 
<213> Unknown 

15 

<220> 

<223> SEQ ID KO:l of U.S. Patent No. 5,478,745 



<400> 7 

20 aggaacccct agtgatggag ttggccactc 
ccgggcgacc aaaggtcgcc cgacgcccgg 
gagcgcgcag agagggagtg gccaactcca 



cctctctgcg cgctcgctcg ctcactgagg 
gctttgcccg ggcggcctca gtgagcgagc 
tcactagggg ttcct 



THE CLAIMS DEFINING THE INVENTION ARE AS FOLLOWS: 



1. . A composition comprising: 

(a) a first adeno-associated virus vector comprising linked. 

i) a first DNA segment comprising a 5'-inverted terminal repeat 
of adeno-associated virus; 

ii) a second DNA segment comprising a portion of an open 
reading frame operably linked to a promoter; 

iii) a third DNA segment comprising a splice donor site; and 

iv) a fourth DNA segment comprising a 3 f -inverted terminal 
repeat of adeno-associated virus; and 

(b) a second adeno-associated virus vector comprising linked: 

i) a first DNA segment comprising a 5'-inverted terminal repeat 
of adeno-associated virus; 

ii) a second DNA segment comprising a splice acceptor site; 

iii) a third DNA segment comprising a portion of an open reading 
frame which together with the DNA segment of (a)ii) encodes 
a functional polypeptide; and 

iv) a fourth DNA segment comprising a 3'-inverted terminal 
repeat of adeno-associated virus. 

2. The composition of claim 1 further comprising a delivery vehicle. 

3. A method to express a polypeptide in a host cell comprising contacting the 
host cell with the composition of claim 1 so as to express the functional 
polypeptide. 
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A method to express a polypeptide in a host cell, comprising: contacting a 
host cell comprising 

(a) a first adeno-associated virus vector comprising linked: 

i) a first DNA segment comprising a S'-inverted terminal repeat 
of adeno-associated virus; 

ii) a second DNA segment comprising a portion of an open 
reading frame operably linked to a promoter; 

iii) a third DNA segment comprising a splice donor site; and 

iv) a fourth DNA segment comprising a 3'-inverted terminal 
repeat of adeno-associated virus; 

with a second adeno-associated virus vector comprising linked: 

i) a first DNA segment comprising a S'-inverted terminal repeat 
of adeno-associated virus; 

ii) a second DNA segment comprising a splice acceptor site; 

iii) a third DNA segment comprising a portion of an open reading 
frame which together with the DNA segment of (a)ii) encodes 
a functional polypeptide; and 

iv) a fourth DNA segment comprising a 3'-inverted terminal 
repeat of adeno-associated virus; 

so as to yield a host cell which expresses the functional polypeptide. 

A method to express a polypeptide in a host cell, comprising: contacting a 
host cell comprising 

(a) a first adeno-associated virus vector comprising linked: 

i) a first DNA segment comprising a 5'-inverted terminal repeat 
of adeno-associated virus; 

ii) a second DNA segment comprising a splice acceptor site; 

iii) a third DNA segment comprising a portion of an open reading 
frame for a polypeptide; and 

iv) a fourth DNA segment comprising a 3'-inverted terminal 
repeat of adeno-associated virus; 

with a second adeno-associated virus vector comprising linked: 
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i) a first DNA segment comprising a 5'-inverted terminal 
repeat of adeno-associated virus; 

ii) a second DNA segment comprising a splice donor site; 

iii) a third DNA segment comprising a portion of an open reading 
frame which together with the DNA segment of (a)iii) encodes 
a functional polypeptide; and 

iv) a fourth DNA segment comprising a 3 l -inverted terminal 
repeat of adeno-associated virus; 

so as to yield a host cell which expresses the functional polypeptide. 

A method to express a polypeptide in a host cell, comprising: contacting a 
host cell with a first adeno-associated virus vector comprising linked: 

(a) i) a first DNA segment comprising a 5-inverted terminal repeat 

of adeno-associated virus; 

ii) a second DNA segment comprising a portion of an open 
reading frame operably linked to a promoter; 

iii) a third DNA segment comprising a splice donor site; and 

iv) a fourth DNA segment comprising a 3'-inverted terminal 
repeat of adeno-associated virus; and 

a second adeno-associated virus vector comprising linked: 

(b) i) a first DNA segment comprising a S'-inverted terminal repeat 

of adeno-associated virus; 

ii) a second DNA segment comprising a splice acceptor site; 

iii) a third DNA segment comprising a portion of an open reading 
frame which together with the DNA segment of (a)ii) encodes 
a functional polypeptide; and 

iv) a fourth DNA segment comprising a 3-inverted terminal 
repeat of adeno-associated virus; 

so as to yield a host cell which expresses the functional polypeptide. 



The method of claim 3, 4, 5 or 6 wherein the host cell is a lung epithelial 
cell, a muscle cell or a neuron. 
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The method of claim 3, 4, 5 or 6 wherein the polypeptide is the CFTR 
polypeptide. 

A method of expressing a polypeptide in the muscle tissue of an animal, 
comprising contacting the muscle tissue with the composition of claim 1 
in an amount effective to express the polypeptide. 

A method of expressing a polypeptide in the airway epithelia of an 
animal, comprising contacting the airway epithelia with the composition of 
claim 1 in an amount effective to express the polypeptide. 

A method of expressing a gene product in the neurons of an animal, 
comprising contacting the neurons with the composition of claim 1 in an 
amount effective to express the polypeptide. 

The method of claim 3, 4, 5 or 6 wherein the polypeptide is /?-globin, y- 
globin, tyrosine hydroxylase, glucocerebrosidase, arylsulfatease A, factor 
VIII, dystrophin, or erythropoietin. 

The method of claim 3, 4, 5 or 6 wherein the host cell is a mammalian 
cell. 

The method of claim 3, 4, 5 or 6 wherein th^ host cell is selected from the 
group consisting of an avian cell, a bovine cell, a swine cell, an equine 
cell, an ovine cell, a canine cell, a feline cell, an amphibian cell, a 
reptilian cell and a fish cell. 

The method of claim 3, 4, 5 or 6 wherein the host cell is a brain cell, a 
retinal cell, a liver cell, a lung cell or a hematopoietic cell. 

A composition comprising: 

(a) a first adeno-associated virus vector comprising linked: 

(i) a first nucleic acid segment comprising a 5-inverted 
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terminal repeat of adeno-associated virus; 

(ii) a second nucleic acid segment comprising a portion of a 
gene which includes a transcriptional regulatory region; 

(iii) a third nucleic acid segment comprising a splice donor site; 
5 and 

(iv) a fourth nucleic acid segment comprising a 3'-inverted 
terminal repeat of adeno-associated virus; and 

(b) a second adeno-associated virus vector comprising linked: 

(i) a first nucleic acid segment comprising a S'-inverted 
10 terminal repeat of adeno-associated virus; 

(ii) a second nucleic acid segment comprising a splice 
acceptor site; 

(iii) a third nucleic acid segment comprising a portion of a gene 
which together with the nucleic acid segment of (a)(ii) 

15 comprises a gene comprising an open reading frame which 

encodes a functional polypeptide; and 

(iv) a fourth nucleic acid segment comprising a 3'-inverted 
terminal repeat of adeno-associated virus. 

20 17. The composition of claim 16 further comprising a delivery vehicle. 

18. The composition of claim 16 wherein the nucleic acid segment of (a)ii) 
comprises a promoter. 

25 19. The composition of claim 16 wherein the nucleic acid segment of (a)ii) 
comprises an enhancer. 

20. A method to express a polypeptide in a host cell comprising contacting the 
host cell with the composition of claim 16 so as to express the 
30 functional polypeptide. 




21 . A method to express a polypeptide in a host cell, comprising: contacting 
a host cell comprising a first adeno-associated virus vector comprising 
linked: 

(a) i) a first nucleic acid segment comprising a 5 -inverted 

terminal repeat of adeno-associated virus; 

ii) a second nucleic acid segment comprising a portion of a 
gene which includes a transcriptional regulatory region; 

iii) a third nucleic acid segment comprising a splice donor 
site; and 

iv) a fourth nucleic acid segment comprising a 3'-inverted 
terminal repeat of adeno-associated virus; 

with a second adeno-associated virus vector comprising linked: 

(b) i) a first nucleic acid segment comprising a 5-inverted 

terminal repeat of adeno-associated virus; 

ii) a second nucleic acid segment comprising a splice 
acceptor site; 

iii) a third nucleic acid segment comprising a portion of a 
gene which together with the nucleic acid segment of (a)ii) 
comprises a gene comprising an open reading frame which 
encodes a functional polypeptide; and 

iv) a fourth nucleic acid segment comprising a 3 -inverted 
terminal repeat of adeno-associated virus; 

so as to yield a host cell which expresses the functional polypeptide. 

22. A method to express a polypeptide in a host cell, comprising: contacting 
a host cell comprising a first adeno-associated virus vector comprising 
linked: 

(a) i) a first nucleic acid segment comprising a 5'-inverted 
terminal repeat of adeno-associated virus; 
ii) a second nucleic acid segment comprising a splice acceptor 
site; 
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iii) a third nucleic acid segment comprising a portion of a gene; 
and 

iv) a fourth nucleic acid segment comprising a 3'-inverted 
5 terminal repeat of adeno-associated virus; 

with a second adeno-associated virus vector comprising linked: 
(b) i) a first nucleic acid segment comprising a 5- 

inverted terminal repeat of adeno-associated virus; 
hh) a second nucleic acid segment comprising a 
10 portion of a gene which includes a transcriptional 

regulatory region and which together with the 
nucleic acid segment of (a)iii) comprises a gene 
comprising an open reading frame which encodes 
a functional polypeptide; 
15 hh) a third nucleic acid segment comprising a splice 

donor site; and 
ii) a fourth nucleic acid segment comprising a 3- 

inverted terminal repeat of adeno-associated virus; 
so as to yield a host cell which expresses the functional polypeptide. 

23. A method to express a polypeptide in a host cell, comprising: contacting 
a host cell with a first adeno-associated virus vector and a second 
adeno-associated virus vector, wherein the first adeno-associated virus 
vector comprises linked: 

(a) i) a first nucleic acid segment comprising a 5*-inverted 
terminal repeat of adeno-associated virus; 

ii) a second nucleic acid segment comprising a portion of a gene 
which includes a transcriptional regulatory region; 

iii) a third nucleic acid segment comprising a splice donor site; 
and 

iv) a fourth nucleic acid segment comprising a 3'-inverted 
terminal repeat of adeno-associated virus; 

wherein a second adeno-associated virus vector comprises linked: 
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b) (i) a first nucleic acid segment comprising a 5- 

inverted terminal repeat of adeno-associated virus; 

ii) a second nucleic acid segment comprising a splice acceptor 
site; 

iii) a third nucleic acid segment comprising a portion of a gene 
which together with the nucleic acid segment of (a)ii) 
comprises a gene comprising an open reading frame which 
encodes a functional polypeptide; and 

iv) a fourth nucleic acid segment comprising a 3'- inverted 
terminal repeat of adeno-associated virus; 

so as to yield a host cell which expresses the functional polypeptide. 

24. The method of claim 21 or 23 wherein the nucleic acid segment of 
(a)ii) comprises a promoter. 

25. The method of claim 21 or 23 wherein the nucleic acid segment of 
(a)ii) comprises an enhancer. 

26. The method of claim 22 wherein the nucleic acid segment of (b)ii) 
comprises a promoter. 

27. The method of claim 22 wherein the nucleic acid segment of (b)ii) 
comprises an enhancer. 

28. The method of claim 20, 21 , 22 or 23 wherein the host cell is a muscle 
cell, brain cell, retinal cell, liver cell, lung cell or hematopoietic cell 

29. The method of claim 20, 21 , 22 or 23 wherein the polypeptide is the 
cystic fibrosis transmembrane receptor, polypeptide, p-globin, y- 
globin, tyrosine hydroxylase, glucocerebrosidase, arylsulfatase A, 
factor VIII, dystrophin, or erythropoietin. 
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30. The method of claim 20, 21, 22 or 23 wherein the host cell is a 
mammalian cell. 

31. The method of claim 30 wherein the host cell is a muscle cell, brain 
cell, retinal cell, liver cell, lung cell or hematopoietic cell. 

32. The method of claim 20, 21 , 22 or 23 wherein the host cell is selected 
from the group consisting of an avian cell, a bovine cell, a swine cell, 
an equine cell, an ovine cell, a canine cell, a feline cell, an amphibian 
cell, a reptilian cell and a fish cell. 

33. The method of claim 32 wherein the host cell is a muscle cell, brain 
cell, retinal cell, liver cell, lung cell or hematopoietic cell. 

34. The method of claim 20, 21, 22 or 23 wherein the vectors comprise 
DNA. 

35. A composition comprising: 

a first adeno-associated virus vector comprising linked: 

(a) i) a first nucleic acid segment comprising a 5'-inverted 

terminal repeat of adeno-associated virus; 

ii) a second nucleic acid segment comprising a portion of a 
gene which includes a transcriptional regulatory region; 

iii) a third nucleic acid segment comprising a splice donor 
site; and 

iv) a fourth nucleic acid segment comprising a 3-inverted 
terminal repeat of adeno-associated virus; 

which first vector, in the presence of a second adeno-associated virus 
vector comprising linked: 

(b) i) a first nucleic acid segment comprising a S'-inverted 

terminal repeat of adeno-associated virus; 
ii) a second nucleic acid segment comprising a splice 
acceptor site; 
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iii) a third nucleic acid segment comprising a portion of a 
gene which together with the nucleic acid segment of(a)ii) 
comprises a gene comprising an open reading frame which 
encodes a functional polypeptide; 
iv) a fourth nucleic acid segment comprising a S'-inverted 
terminal repeat of adeno-associated virus; 
in a host cell yields a RNA transcript which comprises sequences 
from the first adend-associated virus vector linked to sequences from 
the second adeno-associated virus vector, which sequences are 
positioned so that the splice donor site is 5' to the splice acceptor 
site, and which transcript is spliced to a mRNA which encodes the 
functional protein. 

A composition comprising: 

a first adeno-associated virus vector comprising linked: 

(a) i) a first nucleic acid segment comprising a 5-inverted 

terminal repeat of adeno-associated virus; 

ii) a second nucleic acid segment comprising a splice 
acceptor site; 

iii) a third nucleic acid segment comprising a portion of a 
gene; and 

iv) a fourth nucleic acid segment comprising a 3-inverted 
terminal repeat of adeno-associated virus; 

which first vector, in the presence of a second adeno-associated virus 
vector comprising linked: 

(b) i) a first nucleic acid segment comprising a 5-inverted 

terminal repeat of adeno-associated virus; 

ii) a second nucleic acid segment comprising a portion of a gene 
which together with the nucleic acid segment of (a)iii) 
comprises a gene comprising an open reading frame which 
encodes a functional polypeptide, wherein the portion of the 
gene of (b)ii) includes a transcriptional regulatory region; 

iii) a third nucleic acid segment comprising a splice donor site; 
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iv) a fourth nucleic acid segment comprising a 3'-inverted terminal 

repeat of adeno-associated virus; 
in a host cell yields a RNA transcript which comprises sequences 
from the first adeno-associated virus vector linked to sequences 
5 from the second adeno-associated virus vector, which sequences 

are positioned so that the splice donor site is 5' to the splice 
acceptor site, and which transcript is spliced to a mRNA which 
encodes the functional protein. 

10 37. A composition according to any one of claim 1, 16, 35 or 36 

substantially as hereinbefore described with reference to any of the 
figures and/or examples. 
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38. 



A method according to any one of claims 3, 4, 5, 6, 20, 21, 22 or 23 
substantially as hereinbefore described with reference to any of the 
figures and/or examples. 
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